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Abstract 



This thesis studies three topics in quantum computation and information: (1) The 
approximability of "inherently quantum" problems, (2) quantum proof systems, and (3) 
non-classical correlations in quantum systems. Our results in each area are summarized as 
follows. 

Our first area of study concerns the approximability of computational problems which 
are complete for quantum complexity classes. In the classical setting, the study of approx- 
imation algorithms and hardness of approximation is one of the main research areas of 
theoretical computer science. Yet, little is known regarding approximability in the setting 
of quantum computational complexity. Our first result (joint work with Julia Kempe) is 
a polynomial-time approximation algorithm for dense instances of the canonical QMA- 
complete quantum constraint satisfaction problem, the local Hamiltonian problem. Our 
second result (joint work with Julia Kempe) goes in the opposite direction by first in- 
troducing a quantum generalization of the polynomial-time hierarchy. We then introduce 
problems which are not only complete for the second level of this hierarchy, but are in fact 
hard to approximate. 

Our second area of study concerns quantum proof systems. Here, an interesting ques- 
tion which remains open despite much effort is whether a proof system with multiple 
unentangled quantum provers is equal in expressive power to a proof system with a single 
quantum prover (i.e. is QMA(poly) equal to QMA?). Our results here (joint work with 
Jamie Sikora and Sarvagya Upadhyay) study variants of this question. We first show that 
if each unentangled prover has logarithmic size proofs, then this is equivalent to having 
a single quantum prover which sends a classical proof. We then show that a variant of 
the class BellQMA(poly) collapses to QMA. Finally, we give an alternate proof of the fact 
[Harrow and Montanaro, FOCS, p. 633-642 (2010)] that the class SepQMA(m) (which is 
equivalent to QMA(m)) admits perfect parallel repetition. Our alternate proof is novel in 
that it is based on cone programming duality. 

Our final area of study concerns non-classical correlations in quantum systems. Specif- 
ically, in recent years it has come to light that there appear to be genuinely quantum 
correlations in mixed quantum states beyond entanglement which may nevertheless prove 
useful from a computing and information theoretic perspective. Our first result in this area 
(joint work with Animesh Datta) motivates the study of such correlations by exploring pos- 
sible connections to the quantum task of locking of classical correlations [DiVincenzo et al, 
PRL 92, 067902 (2004)] and the DQC1 model of mixed-state quantum computing [Knill 
and Laflamme, PRL 81, 5672 (1998)]. Our second result in this area introduces a novel 
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scheme for quantifying non-classical correlations based on the use of local unitary opera- 
tions. Our third result (joint work with Marco Piani, Gerardo Adesso, John Calsamiglia, 
Pawel Horodecki, and Andreas Winter) introduces and studies a protocol through which 
non-classical correlations in a starting system can be "activated" into distillable entangle- 
ment with an ancilla system. Surprisingly, we find that, according to the non-classicality 
measures derived from our protocol, mixed entangled states can be "more non-classical" 
than pure entangled states. Finally, our last result (joint work with Marco Piani, Ger- 
ardo Adesso, John Calsamiglia and Pawel Horodecki) continues the study of the activation 
protocol above by determining when the entanglement generated with the ancilla can be 
mapped back onto the starting state via entanglement swapping. 
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Chapter 1 
Introduction 



The "paradox" is only a conflict between reality and your feeling of what reality 
"ought to be." — Richard Feynman, 1964 [95]. 

From its earliest days, the theory of quantum mechanics puzzled its inventors. In 1935, 
for example, Einstein, Podolsky, and Rosen published their now famous paper rejecting 
quantum mechanics as a complete physical theory [89]. The problem? The mathematical 
theory of quantum mechanics predicts certain physical phenomena which are completely 
at odds with our everyday understanding of the world around us. To put this into ev- 
eryday language, in 1935 Schrodinger proposed [221] a thought experiment now known as 
Schrddinger's cat, in which under certain circumstances, a cat in a closed box is predicted 
by quantum mechanics to be both alive and dead, at the same time. What could this 
mean? And how much did it trouble the discoverers of quantum mechanics, if it led them 
to ask questions such as: 

/ recall that during one walk Einstein suddenly stopped, turned to me and asked 
whether I really believed that the moon exists only when I look at it. 
— Abraham Pais [5]. 

Clearly, quantum mechanics was not an easy pill to swallow, even for the fathers of the 
theory, many of whom rejected their beautiful child at the time. 

Fast forwarding to the end of the 20th century, however, physicists and computer 
scientists came to a startling realization: As strange as quantum mechanics may seem, 
if its peculiarities could somehow be computationally harnessed, then the possibility of 
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outperforming classical computers with so-called quantum computers may indeed exist. 
In 1982, for example, physicist Richard Feynman proposed [96] the notion of building a 
quantum computer in order to simulate physical quantum systems faster then apparently 
possible with a classical computer (see also Benioff [43, 44, 45]). On the computer science 
side, in 1985 David Deutsch demonstrated a quantum algorithm which outperformed the 
best possible classical deterministic algorithms for what is now referred to as Deutsch 's 
problem [84]. Thus, the roots of the field of quantum computation were sown. Two and a 
half decades later, we now have a number of good reasons for seriously devoting research 
effort to the field of quantum computing, which we now discuss. 

Relevance. We now state three reasons which, in our opinion, justify the study of quan- 
tum computation and information. The first is from an engineering-oriented perspective. 
Up until 2005, the speed of microprocessors increased rapidly, primarily through the brute 
force approach of increasing the number of transistors able to fit on a single microchip. 
Indeed, Intel's original Pentium P5 processor, released in 1993, had a clock speed of 60 
MHz, and consisted of 3.1 million transistors [1]. By 2005, Intel's Pentium 4E Prescott 
processor was up to 3.8 GHz, and packed in a whopping 169 million transistors. Yet, in 
2005, something curious happened: Intel introduced its first dual-core chip, the Pentium 
D Smithfield, which clocked in not at 3.8 GHz, but at a slower 3.2 Ghz. What happened? 
It turns out that the brute force approach to building faster processors has a number of 
seemingly fundamental problems, such as excess heat production and energy loss [3] ; how- 
ever, the primary problem of interest in this thesis is that at the scale current microchip 
components are approaching, the pertinent laws of physics are no longer those of classical 
mechanics, but rather those of quantum mechanics [2]. This raises the natural question: 
Why not just build a computer which works based on the laws of quantum mechanics to 
begin with, i.e. a quantum computer? 

The second motivation for studying quantum computing, and perhaps the most com- 
monly cited one, came with a startling discovery: Peter Shor's quantum factoring algorithm 
of 1994 [22 1]. As whether the question of whether factoring large integers can be done effi- 
ciently on a classical computer has long been open, Shor's algorithm is in itself arguably a 
strong indication that the quantum computational model is indeed one deserving of study. 
Further, since the algorithm's inception, a number of other instances of quantum speedup 
have been uncovered, from Grover's algorithm for unstructured search [122] (which yields 
a square root speedup for NP-complete problems over the brute force approach) to the 
evaluation of NAND trees [92, 24, 66] to estimating quantities related to solving systems 
of linear equations [129], among others. 

The reasons stated thus far, however, are rather "selfish", aiming to exploit quantum 
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mechanics to serve the purpose of the computer science community. There is another view 
regarding the study of quantum computing which follows the converse mantra: Ask not 
what quantum mechanics can do for you, but what you can do for quantum mechanics. 
Indeed, as computation is inherently physical, it follows that understanding the limits of 
quantum computation yields new tools for studying the properties of quantum mechanics 
itself. A primary example of this, discussed further in Section 1.5.4, is that via quantum 
complexity theory, one can give a rigorous proof that a significant problem in quantum 
mechanics, that of estimating the ground state energy of a given local Hamiltonian, cannot 
be solved efficiently (modulo standard complexity theoretic conjectures). Thus, the third 
reason for studying quantum computation is that it not only allows us to learn about the 
limits of computing, but also of physics itself. Moreover, there has even been a pedagogical 
benefit to physics from quantum computing; apparently, there is a growing movement 
to replace the teaching of introductory quantum mechanics using, say, the model of the 
hydrogen atom, with the simpler model of quantum bits and quantum computation [ 
(see Chapter Notes and History for Chapter 10 therein). 

In closing, we have provided three motivations for studying quantum computing from 
engineering, computer science, and physics standpoints. In practice, however, it is of course 
not until a thorough study of quantum computing is undertaken that we will know the pre- 
cise extent to which the field will prove relevant, particularly from a practical technological 
perspective. Such uncertainty lies unfortunately (or fortunately, for the adventurous type) 
at the very heart of the nature of our work as researchers. In the words of one of our 
greats: 

If we knew what it was we were doing, it wouldn't be called 'research', would it? 
— Albert Einstein [5] . 

Focus of this thesis. The field of quantum computation and information nowadays cov- 
ers a broad expanse of topics, with research areas ranging from computer-science-motivated 
topics such as quantum algorithms and quantum proof systems, to engineering or exper- 
imental physics-oriented topics such as how to actually build a quantum computer in a 
lab, to theoretical-physics-motivated topics such as the limits of physical theories and the 
correlations between systems they allow. In this thesis, we focus on three particular areas 
of interest: Approximation of quantum problems, quantum proof systems, and quantum 
correlations. We briefly describe each area below. As each (research) chapter is intended 
to be as self-contained as possible, we defer more in-depth introductions to the beginning 
of each relevant chapter. 
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Our first area of interest is that of approximating quantum problems. Here, by a 
quantum problem, we are referring to a computational problem which is in some sense 
intrinsically related to physical quantum systems in nature. From a complexity theoretic 
perspective, we define such problems as those which are complete for quantum complexity 
classes. (Relevant quantum complexity classes are defined in Section 1.5.) In particular, 
the canonical quantum problem generalizing classical constraint satisfaction which we are 
interested in here is called the local Hamiltonian problem, and it is complete for a quantum 
generalization of NP. (This problem is important from both a quantum complexity theo- 
retic and physics point of view, and as such is given a thorough treatment in Section 1.6.) 
The primary aim of our research in this area is to ask how well such problems can be ap- 
proximated rigorously, in the well-studied classical sense of approximation algorithms and 
hardness of approximation [ ]. In the quantum complexity theoretic setting, this ap- 
proach to approximating physically relevant quantum problems is very much in its infancy, 
and it complements decades of effort by the physics community on similar problems using 
different tools involving heuristics (see e.g. [ 04] for a brief survey). Based on joint work 
with Julia Kempe, Chapters 2 and 3 discuss our results in this area, the first of which is 
a positive result regarding approximation algorithms for the local Hamiltonian problem, 
and the second of which is a negative result involving hardness of approximation for a new 
quantum complexity class generalizing the second level of the well-known polynomial-time 
hierarchy, E^. 

Our second area of interest deals with quantum proof systems. In the classical setting, 
proof systems are one of the cornerstones of complexity theory, with wide-ranging impact 
from the theory of NP-completeness [72, 179] to the stunning PCP theorem [30, 29] of the 
early 1990's. It is thus natural to consider studying quantum proof systems, beginning with 
a quantum generalization of NP called Quantum Merlin Arthur (QMA). However, just as 
quantum mechanics offers new quantum phenomena to be harnessed for the purpose of 
computation, such phenomena now play intriguing roles in quantum proof systems. In 
particular, their presence can turn trivial questions in the classical setting into highly non- 
trivial questions in the quantum setting. For example, in the classical setting, modifying 
NP to allow multiple provers is straightforwardly equivalent in expressive power to the 
original definition of NP, since a single prover can straightforwardly simulate multiple 
provers. However, the question of whether QMA with multiple provers is equal to QMA 
is very challenging, due to the possible presence of strong correlations between quantum 
systems known as quantum entanglement. In joint work with Jamie Sikora and Sarvagya 
Upadhyay, Chapter 4 studies variants of this stubbornly open question. 

Our final area of interest is the study of quantum correlations. As mentioned when 
discussing quantum proof systems above, a pair of quantum systems can display very 
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strong correlations known as entanglement, which is a purely quantum phenomenon; such 
correlations are not possible in the classical setting. As a testament to the mysterious 
nature of quantum mechanics, however, after nearly a century of study, it has only been 
in recent years that a new type of purely quantum correlation has been identified, known 
simply as non- classical correlations. Some of the biggest questions in this area are how to 
quantify and provide operational interpretations for such correlations, as well as to under- 
stand whether and how they may be exploited for computational gain. In joint work with 
Animesh Datta, Chapter 5 studies the role of such correlations in quantum computation. 
Chapter 6 then proposes and studies a novel approach for quantifying such non-classical 
correlations. Finally, Chapters 7 (joint work with Marco Piani, Gerardo Adesso, John 
Calsamiglia, Pawel Horodecki, and Andreas Winter) and 8 (joint work with Marco Piani, 
Gerardo Adesso, John Calsamiglia, and Pawel Horodecki) introduce and study a new proto- 
col which provides an operational interpretation for non-classical correlations by activating 
them into entanglement. 



1.1 Organization 

This thesis is organized as follows. In the remainder of this section, we provide background 
on the basics of quantum computation and information (Section 1.4), and follow with brief 
technical expositions of the various topics studied in this thesis: Quantum computational 
complexity theory (Section 1.5) and quantum entanglement and non-classical correlations 
(Section 1.6). 

The remaining chapters are focused as follows. Chapters 2 and 3 study the approxima- 
bility of quantum complexity theoretic problems, such as the local Hamiltonian problem 
and its variants. Specifically, Chapter 2 presents our approximation algorithm for the lo- 
cal Hamiltonian problem. Chapter 3 then introduces our quantum generalization of E|, 
and shows completeness and hardness of approximation for it with respect to new local 
Hamiltonian-like quantum covering problems we define. 

Chapter 4 discusses our results regarding multi-prover quantum proof systems, showing 
that in a certain setting, multiple quantum provers are no more powerful than a single 
prover. 

Chapters 5, 6, 7, 8 discuss non-classical correlations in quantum systems beyond entan- 
glement. Specifically, Chapter 5 first motivates this direction of work by studying models 
of quantum computing and communication where entanglement does not seem to explain 
the advantage gained in the quantum setting over classical computation. Chapter 6 then 
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presents a novel approach for quantifying non-classical correlations in quantum systems 
based on local unitary operations. Chapter 7 gives an operational interpretation to such 
non-classical correlations by demonstrating an explicit protocol through which such corre- 
lations can be "activated" into entanglement. Chapter 8 further studies and attempts to 
extend the framework of the activation protocol of Chapter 7. 

We now begin in Section 1.2 by collecting common notation used throughout this thesis. 

1.2 Notation 

The following notation is assumed throughout this thesis. The symbols C, IR, Z, and 
N denote the sets of complex, real, integer, and natural numbers, respectively. For m a 
positive integer, the notation [m] indicates the set {l,...,m}. The terms C(X), 'H(X), 
Pos {X), and T>(X) denote the sets of linear, Hermitian, positive semidefmite, and density 
operators acting on complex Euclidean space X, respectively. The projector onto space X 
is denoted Tlx- We sometimes use the shorthand B := C 2 . The notation A y B means 
operator A — B is positive semidefmite. The smallest (largest) eigenvalue of A G H(X) 
is given by X min (A) (\ max (A)). The trace, Frobenius, and spectral (or operator) norms of 
A G £(X) are defined as 

|| A |L := Tr (VAIA) , \\A\\ F := y/Tr(AU), MIL- max || 2 , 

" V / \x)&x s.t. IMj 2 =i 

(1.1) 

respectively, where := denotes a definition. The (m, n)th entry of matrix A is given by 
A(m, n). We define the encoding or description of a matrix A as a classical descrip- 
tion of the entries of A. Specifically, let (A) denote the number of bits used to encode 
A G C to some desired precision. Then, we define the length of the encoding of A by 
(A) := Y2mn (A(m,n)). We extend this straightforwardly to sums of matrices; for exam- 
ple, (X^A) = Si (Ai)' The notation v denotes a vector. Unless otherwise noted, all 
logarithms are taken to base two. We sometimes use the shorthand poly(n) to mean p(n) 
for some fixed polynomial p. 

1.3 Linear algebra 

We now briefly review basic concepts from linear algebra crucial to the content of this 
thesis. Parts of this section follow the course notes of Watrous [246, 245]; the reader is 
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also referred to the text of Horn and Johnson [143] for further details. Those familiar with 
basic linear algebra can safely skim over this section or refer to it as needed. 



Complex Euclidean spaces. The setting in which all the excitement takes place is 
that of a complex Euclidean space X, defined as follows. Let E be a finite, non-empty set. 
Consider the set of all functions from E to the complex numbers C, denoted C s . Then, 
define for any u, v G C s and a G C the addition and scalar multiplication operations in 
the standard way: The addition u + v G C s obeys (u + v)(i) = u(i) + v(z) for all i G E, 
and scalar multiplication au G C s obeys (au)(i) = au(i) for all i 6 E. Then, the set C E 
along with these operations is known as a complex Euclidean space, which we denote as X. 
The dimension of X is given by |E|, the cardinality of S. For concreteness, we henceforth 
assume £ = [d] for [d] := {1, 2, ... , d}, and use the simplified notation C s = C d . 

We think of (column) vectors v G X as d-tuples, i.e. 



for v(i) G C. In quantum computation, v is commonly denoted using \v). Here, |-) is called 
Dirac notation, also sometimes affectionately known as "dog-houses" for vectors [243]. A 
remark about vector notation: Generally, our choice of notation v or \v) will be dictated 
by context. For example, when a vector is to be interpreted as a quantum state, we shall 
use Dirac notation \v); otherwise, we typically revert to the notation v. An exception 
to this rule, even in purely linearly algebraic contexts, is when it is more convenient to 
use Dirac notation, such as when vectors are to be labeled by complicated expressions. 
In much of the introductory discussion on linear algebra that follows, we assume v = \v) 
holds for the pedagogic purpose of familiarizing the reader with Dirac notation. However, 
in general this equality is not assumed to hold; for example, the zero vector is not equal 
to |0) = (1,0) T . We hope the distinction will be clear from context. 

Continuing, the conjugate transpose of v is denoted v^, or (v\ in Dirac notation, and 
is the row vector 



v = 



(1.2) 





for a the complex conjugate of a G C. 
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Vector norms. For any two vectors v, w e X, we define their inner product as 

d 

(v, w) = v*w = (v\w) = v(i)w(i). (1.4) 



i=l 



V 



Then, we measure the length ofv G C d via the Euclidean norm, defined as || v || 2 = a/ (v, v). 
The Euclidean norm is just one of an entire class of norms known as p-norms, defined for 
p G [1, oo) such that 

X>«r) , (1.5) 

and for p = oo as || v := (maxjg^ |i>(i)|) . Note that setting p = 2 yields the Euclidean 
norm. The p-norms have the following properties: 

1. (Positive scalability) || av || = \a\ || v || for a e C. 

2. (Triangle inequality) For any v,w E X, || v + w || < || v || + || w || . 

3. For v G A", if || v || = 0, then v = 0, where denotes the zero vector whose entries 
are all zero. 

From the first two properties, we conclude that for all v e X, \\ v || > 0, since 

= |0| || || p = || • || p = || || p = || v - v || p < || v || p + || -v || p < 2 || v || p . (1.6) 

A useful inequality regarding inner products is the Holder inequality, which states that 
for any v, w e X, 

|(v,w)| < ||v|| p ||w|| 9 (1.7) 

for ^ + ^ = 1. (For p — 1, q — oo.) When p = q = 2, we recover the Cauchy-Schwarz 

inequality. As a testament to the applicability of the latter, we show that || v || : < \fd || v || 2 , 
a frequently useful inequality. Let j be the d- dimensional all-ones vector and |v| the entry- 
wise absolute value of v. Then: 

ii«iii = aM><iaivi>i<iijii 2 iiivin 2 = Vd|iv|| 2 . (i.8) 

It also holds that || v || 2 < y/d \\ v j^, and conversely that || v 1^ > || v || 2 > || v j^. 
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Orthonormal bases. A set of vectors {vj} C X is orthogonal if for all i 7^ j, (vj, Wj) = 
0, and orthonormal if (vj, w 3 -) = 5ij. Here, is the Kroenecker delta, whose value is 
1 if i — j and otherwise. Every complex Euclidean space X of dimension d has an 
orthonormal basis consisting of d elements, where a basis is a set of vectors {vj} C X such 
that any w G X can be expressed as 

d 

w = ^a iVi (1.9) 
i=\ 

for some {ctj} C C. A common basis for X is the computational or standard basis {e^}, 
defined such that = In Dirac notation, we frequently denote this basis simply as 



Linear operators and matrices. Given two complex Euclidean spaces X and y, a 
linear operator or linear map from X to y is a map $ : A' 1— >■ y with the property that 

$ (X^ v ^ =I>i$(vi), ( L1 °) 

where {vi} C X . The set of all such linear maps from X to 3^ is denoted C(X, y), which 
when coupled with operations for addition and scalar multiplication in the standard way, 
yields a vector space of dimension dim(A') dim(^). Here, dim(A') is the dimension of X. 
For brevity, we use the shorthand C(X) to mean C(X, X). 

A convenient way to represent and study linear maps is via their matrix representation. 
Here, an m x n matrix A is a two-dimensional array of complex numbers whose (2, j)th 
entry is denoted A(i, j) e C for i e [to], j G [n\. To represent a linear map $ : C n (->• C m 
as an to x n matrix recall that the action of a map is completely specified by its action 
on a basis. Specifically, the ith column of A§ is given by $(ej) for {e^} the standard basis 
for C n , or 

A„= [$( ei ),$(e 2 ),...,$(e m ) ]. (1.11) 

Recovering $ from A$ thus also follows immediately from this view. When we henceforth 
discuss A G £(X), we are implicitly referring to the matrix representation of map A. 

The product AB of two d x d matrices A and B is defined such that 

AB(i,j) = (v?,cf) (1.12) 
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for rf the zth row of A and the jth column of B. In general, it is not true that 
AB = BA. The difference AB — BA is called the commutator [A, B] of A and B, and the 
anti- commutator is {A, B} = AB + BA. 

The rank of A G 3^) is the dimension of its image, where the latter is defined as 
Im(A) := {y G y | y = Ax for some x G A 1 }. The rank satisfies 

rank(AB) < min {rank(A), rank(£)}. (1.13) 

Defining the null space or kernel of A G as Ker(A) := {v G A" Av = 0}, it holds 

that dim(Ker(A)) + dim(Im(A)) = d. 

Eigenvalues and eigenvectors. For any A G C(X), we say v is an eigenvector of A with 
eigenvalue A if v ^ and Av = AA The multiset of eigenvalues of A (with multiplicity) is 
known as its spectrum. The eigenvalues of A arise as the roots of the degree-<i characteristic 
polynomial of A, pa, defined such that 

p A (x) := det(xl- A), (1.14) 

where I(i,j) '■— d~ij is the Identity matrix and det is the determinant. One way to define 
the latter, known as the Laplace expansion, is via the recursive definition 

d 

det(A) = J2(-V i+J Mh3)det(A tJ ). (1.15) 

Here, Ay is the matrix obtained from A by deleting row i and column j, and we define the 
base case of this recursion (i.e. a 1 x 1 matrix [c]) as det([c]) = c. This equation holds for 
any i G [d]. 

Matrix operations. A number of operations on matrices A G X arise repeatedly in 
quantum computing. First, the complex conjugate, transpose and adjoint operations are 
respectively defined via 

A(i,j) := {MsW) A T {i, 3 ) := A(j,i) ^ := (Af '. (1.16) 

These operations apply to vectors as well so that (v\, defined in Equation (1.3), is simply 

The trace of A is a linear function defined as Tr(v4) := J2 i=l A(i,i) = ^2 i= iK{A), 
where {A,(v4)} C C are the eigenvalues of A Henceforth, when clear from context, we 
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simply write A« for the latter. The trace has the useful property of being cyclic, i.e. 
Tr(ABC) = Tr(CAB). With the trace in hand, we can define an inner product on C(X) 
as (A, B) = Tr(A*B). 

The tensor product is an important operation through which joint quantum systems 
can be described. Specifically, for complex Euclidean spaces X and y, their tensor product 
is X <g) y — C dxXdy . For vectors u G X and y G y, we define for all i E [d x ] and j G [d y ] 

{vL®\)(i,j):=u(i)v(j). (1.17) 

For linear operators A G C(X), B G £(3-0, A® B yields a complex matrix whose index 
sets are given by ([d x ] x [dy], [d x ] x [dy]), such that 

(A ® ji), (i 2 , j 2 )) := Afa^BO'i,^) (1.18) 

for all zi, 22 G [d x ] and ji,j2 G [dj,]. The tensor product has the following properties for 
any A,C E X, B, D E y, c E C: 

(A + C)®B = A®B + C®B (1.19) 

A®(B + D) = A® B + A® D (1.20) 

c(A®B) = (cA)®B = A®(cB) (1.21) 

(A(8)S)(C®D) = AC(g)BD (1.22) 

Tr(A®5) = Tr(A)Tr(fi) (1.23) 

(A ® 5) 1 " = ^(8) St. (1.24) 

These properties hold analogously in the vector setting. 

Given the composition of two spaces X and 3^ via the tensor product, we also require an 
operation in the reverse direction for removing one of these spaces. For this, we define the 
linear partial trace map. Specifically, for A®B E £(X<g>y), the partial trace Trx(A<S>B) E 
y is defined as 

Tr x (A <g> B) := Tr(A)B. (1.25) 
Alternatively, for any orthonormal basis {vj}^ =1 for X, we can write for A E C(X <g> y) 

d 

Tr x (A) = (vj ®/)A(vi<g> I). (1.26) 
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Special classes of operators. A few classes of linear operators play important roles in 
quantum computing. The first of these is the class of Hermitian operators H(X) C C(X), 
defined as the set of A G C(X) satisfying = A. As the set of Hermitian operators is 
closed under addition and real scalar multiplication, and since (A, B) G K for all A, B E 
%{X), it follows that H(X) forms a real inner product space of dimension d 2 . 

The eigenvalues of Hermitian operators are real. If the eigenvalues of Hermitian A are 
in {0, 1}, then equivalently A 2 = A, and A is called an (orthogonal) projection. (Non- 
Hermitian A satisfying A 2 = A are called oblique projections, and are not used here.) 

More generally, a Hermitian matrix A G T-L(X) whose eigenvalues are all non-negative 
is called positive semidefinite, denoted A >z (more generally, the notation A >z B means 
A — B y 0). Positive semidefinite matrices A G 'H(X) can equivalently be characterized 
as follows: 

• xUx > for all xGl 

• A = B^B for some B E C(X). 

The set of positive semidefinite operators acting on X is denoted Pos (X). 

Next, a unitary operator U E U(X) is defined as satisfying UW = WU = I. The 
eigenvalues of U are complex numbers of modulus 1. All unitary operators preserve the 
length of any vector v, i.e. (Uv,Uv) = (v, v). More generally, any U G C(X,y) with 
U^U = Ix is called an isometry. 

Hermitian, positive semidefinite, and unitary matrices are in fact all special cases of 
normal matrices A, defined such that AA< = A^A. Normal matrices are important due to 
the Spectral Decomposition theorem, which we discuss next. 

Matrix decompositions. An extremely useful property of normal matrices A acting on 
X is that they can be written in terms of their spectral decomposition, i.e. 

d 

A — ^2 W\){\\ — UDU\ (1.27) 
i=i 

where recall Aj are the eigenvalues of A, the set {|Aj)}^ =1 is a corresponding orthonormal set 
of eigenvectors of A, D — diag({Aj}) is a diagonal operator with entries D(i,i) = Aj, and 
U is a unitary matrix whose ith column is |Aj). Here we have switched to Dirac notation 
to highlight, in our opinion, one of its strengths — the ability to label vectors easily by 
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complicated expressions. Note that if Aj ^ Aj for all then the set of eigenvectors above 
is unique. 

A common problem in quantum mechanics is to analyze the spectrum of a sum of two 
matrices A, B G C(X). In general, this is a difficult problem. However, if the matrices are 
normal and they commute, i.e. [A, B] = 0, then this task is made easier by the fact that A 
and B must simultaneously diagonalize. In other words for normal A and B, [A, B] = if 
and only if there exists an orthonormal basis {\bi}} C X such that 

d d 
A = Y,HA)\bi)(bi\, B = Y,HB)\bi)(bi\. (1.28) 

i=l i=\ 

While the spectral decomposition holds only for normal matrices, a more general decom- 
position known as the singular value decomposition exists even for non-square matrices. 
The latter says that for any d y x d x matrix A G C(X, y), we have 

A = UDV ] (1.29) 

for d y x d y unitary U, d x x d x unitary V, and d y x d x diagonal matrix D whose entries 
D(i,i) are non-negative real numbers called the singular values of A. 



Operator functions. With the spectral decomposition in hand, we can now apply func- 
tions / : C H- C to normal operators A G X as follows. Let A have spectral decomposition 
A = Yli=i Then, assuming {Aj} is a subset of the domain of /, 

d 

f(A):=Y,f(^)(M- (1-30) 
i=i 

Three common functions / encountered in this thesis are f(x) = e x , f(x) = logx, and 
f(x) = y/x, the operator functions of which are denoted as e A , log A, and y/A, respectively. 
Here, the logarithm is taken to base two. 



Operator norms. Similar to the p-norms we defined for vectors, a useful class of norms 
for measuring the "length" or "magnitude" of a matrix are the Schatten p-norms. Their 
definition is simple: For any p G [l,oo], let a (A) denote the vector of singular values of 
A EX. Then, 

\\A\\ p :=\\a(A)\\ p . (1.31) 
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A particularly nice aspect of this definition is that for Hermitian operators, 0i(A) = \Xi(A)\. 
Moreover, properties of the vector p-norms carry over straightforwardly to the Schatten 
p-norms, such as the Holder inequality, positive scalability, and the triangle inequality. 

Some further important properties of the p-norms for any A G C(X) are: 

1. II A II =|| A || = || A T || , from which also II A \\ = \\ AU\ . 

" "p ii iip ii iip' n "p ii iip 

2. (Invariance under isometries) || UAV^ || = || A \\ p for any isometries U and V for 
which UAV^ is well-defined. 

3. II ABC II < || A|| ||B|| ||C|| . 

II Hp — II M oo II Hp II II OO 

4. (Submultiplicativity) || AB || < || A || || B || . This follows from Property 3. 

There are three specific values of p of interest here: p — 1, p — 2, and p = oo. They 
correspond to the trace, Frobenius, and spectral (or operator) norms, respectively, and can 
alternatively be defined as 

||A|| tr :=Tr(^AtA), M|| p := A/Tr(AtA), 11^:= max ||A|x)|| 2 . 

V ' \x)£X S.t. ||x|| 2 =l 

(1.32) 

The trace norm has two further properties of interest: First, it is non-increasing under the 
partial trace, meaning that for A G L{X ® y), \\ Tiy(A) || tr < || A || tr . Second, for unit 
vectors u, v G X we have 

|| uut - W || tr = 2 V / l-|(u,v)| 2 < 2 || u - v || 2 . (1.33) 

The second inequality follows by expanding the definition of the Euclidean norm and 
applying the identity 1 — x 2 < 2(1 — x). The first equality follows [246] by noting that 
A := uu^-vvt is Hermitian, and so its trace norm is a function of the absolute values of its 
eigenvalues, which we now analyze. Since rank(A) < 2 and Tr(A) = 0, its spectrum must 
be {A, —A, 0, . . . , 0} for some A G R. Thus, Tr(A 2 ) = 2A 2 . However, a direct evaluation of 
Tr(A 2 ) from the definition of A also reveals Tr(A 2 ) =2 — 2 |(u, v)| 2 . Combining these two 
expressions for Tr(A 2 ), the claim follows. 

Linear super-operators. We have discussed (linear) operators $ : X i— > X and $ : y \— > 
y. Moving a step up the ladder, we can also discuss linear operators $ : C(X) i— > C(y). 
Such maps are called linear super- operators. Bestowed with the standard definitions of 
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addition and scalar multiplication, the set of super-operators, denoted T(X,y), forms a 
linear space. The tensor product operation applies analogously to super-operators as it did 
to operators. 

The adjoint of super-operator $ G T(X,y), $* G T(y,X), is uniquely defined by the 
equation 

(A,Q(B)) = (&{A),B), (1.34) 
which holds for all B G and i G 

Special classes of super-operators. From a quantum computing perspective, we are 
most interested in super-operators which are trace-preserving and completely positive 
(TPCP). A trace-preserving super-operator $ G T(X,y) is defined as satisfying 

Tr(A) = Tr($(A)) (1.35) 

for any A G C(X). To define a completely positive map, we first define a positive map 
$ G T(Af,^) as satisfying b for any A G £(#) such that A ^ 0. Then, a map 

$ G T(rY, y) is called completely positive if Ic(x) ® $ is a positive map. Intuitively, a 
completely positive map $ sends positive semidefinite operators to positive semidefinite 
operators, even if $ acts on only part of a larger composite system. 

Matrix representations of super-operators. Just as we discussed a matrix represen- 
tation for linear operators, there are a number of useful matrix representations for linear 
super-operators. (See the notes of Watrous [247] for an excellent exposition.) Here, we 
discuss two particular representations used in this thesis, known as the Stinespring and 
Kraus representations. 

The Strinespring representation lends a nice interpretation to admissible quantum maps 
later. Specifically, it says that the action of any TPCP map $ G T(X,y) on arbitrary 
X G C(X) can be written as 

${X)=Tr 2 (AXAi), (1.36) 

for some complex Euclidean space Z and some linear isometry A G C(X, y®Z). Moreover, 
dim(Z) can be taken as dim(2) < dim(rY) dim(^). In the context of quantum computa- 
tion, it will be particularly useful to note that this is equivalent [21] to saying $(Jf) can 
be written as, for 3^ = = 3^2, 

$(X) = TW 2 [U(X X ® lOXOl^Jt/t] , (1.37) 
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for some unitary U EU(X Cg> ® 3^) • 

We now define the Kraus representation, which is sometimes also known as the operator- 
sum representation [200]. The Kraus representation says that any TPCP map $ E T(X, y) 
can be expressed in terms of a set of Kraus operators {Kj} i=1 C C(X, y) such that 

k 

<s>{x) = Y, K i XK l ( L38 ) 

1=1 

where Y^h=i KiKi = Ix and k < dhn(X) dim(^). 

1.4 Basics of quantum computation 

We now introduce the basics of quantum computation. For further details, the interested 
reader is referred to the texts of Nielsen and Chuang [ 0], Kitaev, Shen, and Vyalyi [171], 
and Kaye, Laflamme, and Mosca [162]. From a computer scientist's perspective, note that 
the primary background required is not quantum physics, but rather linear algebra [143]. 
This is because, just as with any (say) sports game, in order to play the game, you simply 
have to learn the rules of the game. Quantum mechanics, in particular, has four simple 
rules, and they are all based on linear algebra. These rules govern the following four 
intuitively logical concepts: How a quantum state is described, how does one "read" or 
measure a quantum state, what operations can be performed on a quantum state, and 
finally, how does one describe multiple quantum systems jointly. 

1.4.1 Describing quantum states 

Let X denote a complex Euclidean space. Then, in a nutshell, any p E Pos (X) with 
trace 1 describes a valid quantum state. Let us now provide some intuition as to how this 
statement comes about. 

In classical computing, the basic unit of information is a bit, which takes on values in the 
set {0, 1}. One can equivalently encode a bit using the set {|0), |1)}, where {|0), |1)} C C 2 
is the standard basis for C 2 , i.e. |0) = (1, 0) T and |1) = (0, 1) T . The key difference between 
classical bits and qubits is that in the quantum world, one can interpolate between the two 
discrete values |0) and |1) by taking a superposition, i.e. the vector 

|^) = a|0) + /3|l) (1.39) 
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describes a valid quantum state if \a\ + \/3\ = 1. In other words, any unit vector in C 2 
describes a quantum bit, or qubit. 

More generally, assume X has dimension d. Then, any unit vector G X describes 
a rf-dimensional quantum state, sometimes dubbed a que? it. Such vectors are called 'pure 
states, and do not yet capture the set of all possible (i-dimensional quantum states. To 
complete the picture, we simply allow probabilistic mixtures of such pure states, more 
generally referred to as mixed states. Such probabilistic mixtures are described in the 
following straightforward manner, known as the density matrix formalism. 

Associated with any probabilistic mixture is an ensemble, 

{MLvmmtx}, (i-4o) 

where {pi\ k i=l forms a probability distribution and (l^)} Q X is a set of unit vectors. The 
corresponding mixed quantum state p is thus: 

k 

P = ^2pM(i>i\- (i.4i) 

Here, p is called the density matrix describing the underlying quantum state. We denote 
the set of density operators acting on X as V(X). 

Let us now tie this back into the statement made at the beginning of this subsection. 
Note that since in Equation (1.41), p is a non-negative sum of positive semidefmite opera- 
tors, we must have p >z 0. Moreover, by applying the cyclic property of the trace, we have 
Tr(p) = 1, as claimed. Indeed, based on the exposition above, we can now intuitively see 
why any p G X with p >z and Tr(p) = 1 describes a valid quantum state — simply take 

the spectral decomposition of p to recover an ensemble |{pj}^ =1 , {|V'i}('0i|}i=i|- 

We remark that although here we have attempted to present a simple exposition of 
how quantum states are classically described, in reality the precise interpretation of what 
such a classical description means is highly non-trivial and continues to be debated after 
decades of research. 

1.4.2 Measuring quantum states 

Now that we have a mathematical description of quantum states, we require a formalism 
for modeling how a quantum state is "observed", or measured. For this, let p G V(X) 



17 



be a density matrix. Then, a quantum measurement is formalized by a set of operators 
IT : = {Mi} C £(#) satisfying 

^M/M = J, (1.42) 

where the latter is called the completeness relation. The act of measuring p with II is in 
general an inherently probabilistic process, even if p corresponds to a pure state (unlike 
in the classical case of bits). Specifically, when measuring p with respect to II, we obtain 
outcome i with probability given by 

Pr(outcome i\p) = Tr(M iP Mj). (1.43) 

Once a particular outcome i is observed, the state p "collapses" to a new state p' consistent 
with this outcome, i.e. 

= M,pMj 
Pr(outcome i\p) 

Note that the denominator above serves the role of renormalizing p' so that Tr(p') = 1. 

We have thus far described general measurements. Often, we are interested in the 
special case when each Mi is an orthogonal projection operator (not necessarily of rank 
one), such that MiMj = 5ijMi. Such measurements are called projective or von Neumann 
measurements. A common way to represent a projective measurement is via an observable 
M e Via the spectral decomposition, we can write M = AjlT, where A« ^ Aj 

for % 7^ j and each IT is a projection operator (of rank possibly greater than one). Then, 
each eigenvalue Aj corresponds to a distinct label for a measurement outcome, and the 
measurement operators are Mj = IT. An advantage of using observables is that the 
expected value of the measurement, denoted Em, takes a very simple form: 

Em(p) = J2 A * Pr (outcome i\p) = \Tr(U iP U\) = A i Tr(II i/ o) = Tr(Mp). (1.45) 



Finally, note that the framework above for general measurements II = {Mj} allows 
one to determine both the probability of outcome i, as well as the output state of the 
measurement process once % is read. If we only care about the former, as is the case 
in situations where the quantum system is only to be measured once and subsequently 
discarded, then this formalism is often simplified by defining positive semidefinite E { : = 
MiMi with YL% E i = L We nence nave: 

Pr(outcome i\p) = Tr(MpM/) = Tr(M/Mp) = Tr(^p). (1.46) 
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The set {Ei} is called a Positive Operator- Valued Measure (POVM). An advantage of 
using POVMs, for example, is that since the POVM elements Ei are positive semidefinite, 
optimizations over the set of all POVMs can be handled via semidefinite programming 
techniques. 

1.4.3 Evolution of quantum states 

We now know how to describe a quantum state p G T>(X), as well how to model a mea- 
surement or observation of p. The next question we ask is: What kind of operations can 
we perform on p? For example, to a classical bit, we can apply a NOT gate to flip its 
value. What can we do to a gitbit? 

In the quantum setting, the set of valid operations on a closed (defined shortly) quantum 
system with state p G T>{X) is the set of unitary operators U G U(X). Specifically, U maps 
p to 

p> ■= u P U ] . (1.47) 

For example, for p G P(C 2 ), i.e. a single qubit, a frequently used set of unitary operators 
are the Pauli operators (where i := y/^l G C) 

*-(:;) *-(;^)- <-> 

Note, for example, that the Pauli X plays the role of a quantum NOT gate, i.e. A|0) = |1) 
andX|l) = |0). 

We said that unitary operations describe the evolution of closed quantum systems above 
- let us elaborate on this further. A closed quantum system is one which does not interact 
with its environment. Conversely, if a system is not closed, it is called open. In this latter 
case, the set of allowed operations strictly contains U(X), and is in fact the set of TPCP 
maps, which we henceforth refer to as admissible maps or operations. Despite this, there is 
a sense in which discussing unitary operations is without loss of generality — this is implied 
by the Stinespring representation of super-operators and specifically Equation (1.37), which 
states that any valid TPCP operation on a quantum system A can be simulated by moving 
to a larger joint system AB, evolving AB via a unitary operator, and subsequently tracing 
out part of AB. (We discuss joint systems AB further in Section 1.4.4.) 

For example, let us consider the process of performing a measurement on A. In order to 
measure or observe a quantum state in A, one introduces a measurement apparatus, which 
we think of as system B. To complete the actual measurement, B must interact with A, 
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implying A is an open system. Thus, if we look at A alone, the action of the measurement 
on A is not described by a unitary operator, but by a TPCP map. However, if we instead 
look at AB as a whole, this joint system is now closed, and hence its evolution is described 
by a unitary operator. 



Hamiltonians, and the connection to unitary operations. We said above that the 
evolution of a (closed) quantum system is described by a unitary operator. Although this 
is a great abstract description for mathematicians and computer scientists to work with, 
one should ask the question: Why unitary operations? The answer lies, not surprisingly, 
in physics. Here we define the notion of a Hamiltonian, which will play an important role 
in later chapters such as those involving Hamiltonian complexity. 

First, note that any unitary U G IA{X) can be written as U = exp(iH) for some 
H G H(X). This is easily seen by taking the spectral decomposition U = j \ipj)(i/}j\, 



sen uy ueuvuig me speunai ueuuiiijjusiiiuii u — / ■ t 

and observing that defining 

H = ^ 



H = YM1>i)Q>s\ (1-49) 



yields U = e tH (see the discussion on operator functions in Section 1.3). The operator H 
is called a Hamiltonian. 

Thus, corresponding to each U G U(X), there exists an if 6 %{X). Where does H 
then come from? It turns out that the time evolution of a closed quantum system \ip) 
according to H is given by the famous Schrodinger equation, 

i h d ^ = H\iP), (1.50) 

where h denotes Planck's constant (whose value is not of interest here). For a quantum 
system evolving from time t\ to t 2 , the solution to this equation is given by 

m 2 )) = exp (^-^h) m,)), (1.51) 



n 

from which we now see the connection to unitary operators directly. 

For this reason, Hamiltonians have been the object of intense study, and there is nowa- 
days an entire field devoted to Hamiltonian complexity (see Section 1.5). The eigenstates 
{|A)} of a Hamiltonian are referred to as its energy eigenstates, and the eigenvalue A cor- 
responding to | A) is the energy of state |A). The smallest eigenvalue A min of H is called 
the ground state energy, and \X-mm) the ground state of H. Determining the ground state 
energy of a given H is in general a very difficult problem, as we shall soon see in Section 1.5. 
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Before closing, we make two final remarks. First, there is another interpretation of the 
Hamiltonian versus unitary pictures of time evolution presented here which is of interest. 
The application of any fixed unitary U can be thought of as a discrete-time evolution, since 
by Equation (1.51) it corresponds to evolution by some fixed time t. In the Hamiltonian 
picture, however, for any fixed Hamiltonian H, one can in principle vary the time of 
evolution t as desired, resulting in a notion of continuous-time evolution. 

Finally, in our discussion here we have focused on time-independent Hamiltonians. 
More generally, one can also consider evolution under time- dependent Hamiltonians which 
are allowed to change with time. 



1.4.4 Composite quantum systems 

Thus far, we have discussed the basics of how to mathematically discuss single quantum 
systems. Suppose now we have two quantum systems A and B — how do we describe their 
joint state AB? It turns out that if A and B correspond to complex Euclidean spaces X 
and y, then the joint system AB corresponds to the space X ®y . In other words, if, for 
example, X = y = C 2 , then any p G V(X <g> y) defines a valid two-qubit quantum system. 

The simplest examples of two-party systems AB are given by product states, which for 
any given pA G V(X) and ps G T>(y), are given by pa <8> Pb- Such states are uncorrelated 
between systems A and B. For example, two classical bits in state 00 can be embedded 
in such a two-qubit quantum state as |0) <S> |0). For brevity, when discussing pure states, 
we simply denote this state as |0)|0) or |00). More generally, one can also consider joint 
states |0) G C 2 <g> C 2 such as 

|0 + ) = ^|OO> + -^|11>. (1.52) 

This state is referred to as a Bell state, and possesses a strong degree of quantum correla- 
tions between systems A and B known as quantum entanglement, as discussed further in 
Section 1.6. 

Given a description p of the state of a joint system AB, we now require a method for 
describing the marginal state on A (or B) alone. Specifically, given a composite system 
p G V(X <g) y), the reduced state pa on A (analogously, pb on B) is given by the partial 
trace operation described in Section 1.3. In other words, 

p A = Tr B (p). (1.53) 
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For example, Tyb(pa <8> Pb) is simply pa, and Trs(|0 + ) (0 + |) = 1/2. The partial trace 
is employed here as it is the unique function which correctly produces the measurement 
statistics for arbitrary observables M measured on A alone. 

We close by remarking that our description of two-party composite systems straightfor- 
wardly extends to multiple parties: For systems A\ through A n corresponding to complex 
Euclidean spaces X\ through X n , the corresponding joint space is given by <S>™ =1 

1.4.5 Quirks of quantum mechanics 

Marking a drastic departure from the classical setting, a fundamental result in quantum 
mechanics is that an unknown quantum state G X cannot be copied or cloned. This 
is called the No-Cloning Theorem [86, 258]. To give a brief flavor of why this holds, we 
demonstrate a simple proof from Nielsen and Chuang [ ] (Box 12.1) for the case regarding 
the non-existence of a unitary U G U(X ® X) achieving the mapping 

\^) x ®\s) x ^\^) x ®\^) x , (1.54) 

where \s) is some fixed starting state. For sake of contradiction, suppose such a U does 
exist. Then for vectors l^i),^), let 

\<t>x) ■= Wi> ® l*» = |V>1> ® l^i) (1-55) 
:= U(\ip 2 }®\s)) = \ip 2 )®\<ip 2 }. (1.56) 

Then, (4>i\(f>2) = (V'llV^) = ({ipi\ip2)) 2 ■ But the equation x = x 2 only has solutions and 
1, implying that for general and ^2); such a U cannot exist. We remark that using 
the Stinespring representation, this proof is easily adapted to show that even TPCP maps 
cannot clone non-orthogonal states [261]. 

1.5 Quantum computational complexity 

With the basics of linear algebra and quantum computing under our belts, we can now begin 
discussing the first central area this thesis studies: Computational complexity theory. This 
field aims to rigorously classify computational problems based on the inherent difficulty of 
solving them. Specifically, the central idea here is to ask: 

Given a set of resources S, such as a certain amount of space or time in which a 
computation is to run, what is the class of computational problems which can be solved? 
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This approach has led to an entire zoo of such complexity classes (literally, a zoo [12]), 
including the ubiquitous classes P and NP. In this section, we review the extension of some 
of these concepts to the quantum setting. This includes defining the standard quantum 
circuit model our work is based on, introducing relevant quantum complexity classes, and 
presenting an exposition of the quantum version of the Cook-Levin theorem [72, 179]. The 
content of this section is based partly on the excellent surveys of Aharonov and Naveh [: 
and Watrous [248] , as well as the text of Nielsen and Chuang [200] . We assume background 
knowledge of basic (classical) computational complexity; the interested reader is referred 
to the text of Arora and Barak for an introduction [27]. 

Notation and definitions specific to this section. Throughout our discussion, we 
encode all computational problems over the binary alphabet £ := {0, 1}. We say a function 
/ : X* i— > S* is polynomial-time computable if there exists a polynomial time deterministic 
Turing machine which, given any input x G £*, outputs f(x). A function / : N i— > N is 
called polynomially-bounded if there exists a polynomial-time deterministic Turing machine 
which, on any input x G N, outputs l" x K A language is a partitioning S* = v4 yes U A no 
such that Ay es H A no = 0, for the empty set. If, more generally, A yes U A no C £*, then we 
have a promise problem. In a promise problem, one assumes the input x satisfies x G A ycs 
or x G A QO ; if an algorithm solving this promise problem is given input x G" A yes U A no , 
we adopt the convention that the algorithm is allowed to err. We remark that promise 
problems are particularly natural in the quantum setting, as quantum computations are 
inherently probabilistic processes, and as such, some "margin of error" appears to be needed 
separating A yes from A no . This is clarified further when introducing our relevant quantum 
complexity classes. 

1.5.1 Quantum circuit model 

In Section 1.4.3, we discussed the general types of admissible operations on quantum 
systems. In the context of complexity theory, however, we require a formal model for 
specifying and analyzing such operations, for which we employ the standard quantum 
circuit model. To begin, suppose we have a quantum system consisting of n qubits, whose 
associated complex Euclidean space is X — (C 2 )® n . A quantum circuit can be thought of 
as a directed acyclic graph with n input nodes of in-degree zero and out-degree one (i.e. n 
sources), n output nodes of in-degree one and out-degree zero (i.e. n sinks), and a set of 
"intermediate" nodes or gates, each of which has matching in- and out-degree c for some 
c G 0(1) (where each gate can have a different value of c). Intuitively, the input (output) 
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nodes are the n input (output) qubits to the circuit, and the intermediate notes are unitary 
gates acting on 0(1) qubits. The edges of the graph correspond to wires in the circuit, the 
direction of which are indicative of the direction of data flow. 

For example, three common single-qubit unitary gates mentioned in Section 1.4.3 are 
the Pauli X, Y, and Z operators, which are specified in the circuit model as: 



\b) ~ 


X 


— |6© 1) 


\b) ~ 


Y 


— (-l) b i\b® 1) 


\b) ~ 


Z 


- (~l) b \b) 



1.57) 
1.58) 
1.59) 



Here, we assume b G {0, 1}; the action of each gate is extended to all single qubit states 
by linearity. The notation © denotes the XOR operation (i.e. addition modulo 2). On the 
left of each gate is the input qubit, and on the right is the output qubit. 

Two other single-qubit gates, whose importance is discussed shortly, are the Hadamard 
and T (also referred to as 7r/8) gates, defined below. 
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A ubiquitous two-qubit gate is the Controlled-NOT gate, shown below. 
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Finally, a measurement (in the computational basis) in this model is specified by the 
following. 

' ' [1-63) 



Universal gate sets. When it comes to quantifying the cost of a circuit, it seems a 
priori that we are in a bind: How do we quantify the cost of an arbitrary gate if there 
is a continuum of unitary gates to choose from? It would be preferable to have a fixed 
finite set of gates, each of which is assigned unit cost, and with which we could simulate 
all other gates. This would yield a rigorous framework in which to quantify the cost of 
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a circuit. Such a set of unitaries is called a universal set, and indeed exists: The set 
S := {H,T,CNOT} is universal. To show this (see, e.g., [ 0]), one first demonstrates 
that the CNOT coupled with the set of all one-qubit unitaries is universal in an exact sense 
- any unitary U G U(X) can be represented exactly using CNOT and single-qubit gates. 
One then applies the Solovay-Kitaev theorem [173], which yields that for any U G W(C 2 ) 
and any e > 0, there exists a V G W(C 2 ) consisting of the composition of 0(log c (l/e)) 
gates from {H,T} such that || U — V < e (here, c G 0(1)). 

What does such a bound on the spectral norm buy us? Suppose we can substitute 
the original unitaries U = U m ---Ui in a circuit with unitaries V = V m ---V\ with the 
promise that || Ui — Vi < e for all i G [m] and for e to be chosen as needed. Since we are 
typically interested in running U on some input | ■?/>), followed by a measurement according 
to some POVM, we would like the probability of obtaining any measurement outcome to 
deviate by at most S when substituting V for U, where 5 > can be chosen as desired. In 
other words, for all POVM elements M, pure states and error parameters 5 > 0, we 
would like that setting e small enough yields that the probability of obtaining outcome M 
when measuring U\ip) versus V\ip) differs by at most 5. Indeed, this is achieved by setting 
e = 5 /(2m) and combining the facts that 

\Tr(MU\ij))(il)\tf) - Tt(MV\^)(^)\V^)\ < 2 || U - V , (1.64) 

and 

m 

|| U m ■ ■ ■ Ut - V m ■ ■ ■ V x IL < J2 II U s - Vj |L . (1.65) 

3=1 

We refer the reader to [ !00] for further details. 

We close this section by remarking that here we have assumed that quantum circuits 
are unitary and act on pure state inputs G X; recall from Section 1.4.3 that by the 
Stinespring representation and Equation (1.37), this is without loss of generality. We refer 
the reader to the work of Aharonov, Kitaev, and Nisan [ ! ] for a more general model of 
quantum circuits which directly operates on mixed states, and which explicitly harnesses 
this connection with the Stinespring representation. 

Oracles. A commonly used construct in the setting of quantum circuits is that of an 
oracle. An oracle Q n (where we more precisely deal with a family of oracles {Q n }) can be 
thought of as a black-box unitary operation encoding some predicate / : S n i— > S. In the 
quantum circuit model, this is formalized via the action 

Qn\x)\y) = \x)\y@f(x)), (1.66) 



25 



for i6E™ and y G S. Each such application of Q n is called a query to the oracle, and we 
typically think of each query as having unit cost. 

Suppose now that we wish to compute some property P of the predicate /; the number of 
queries to Q n required to do so is called the query complexity of P (relative to Q n ). Perhaps 
the most well-known example of this in the quantum setting is Grover's algorithm [113], 
which shows how to compute the OR function Vt=i /(*) with high probability using 0(\/2") 
queries to Q n , a quadratic improvement over the classical setting. Although the query 
model may a priori seem restricted, the model is nevertheless important; Shor conceived 
his factoring algorithm [221], for example, by studying Simon's algorithm [225, 226] from 
the quantum query model. 

1.5.2 Standard quantum complexity classes: BQP and QMA 

Recall that in complexity theory, we classify computational problems into complexity 
classes depending on the resources capable of solving them. The classes P and NP are two 
such classes, forming two cornerstones of classical complexity theory. We now discuss the 
natural quantum analogues of these classes, BQP and QMA. (More precisely, BQP and 
QMA are generalizations of BPP and MA.) For completeness, we recall the definitions of 
P and NP below. 

Definition 1.1 (P). A promise problem A = (A yes ,A no ) is in P if and only if there exists 
a deterministic polynomial-time Turing machine M which on input x G A jes , accepts, and 
on input x G A no , rejects. 

Definition 1.2 (NP). A promise problem A = (A ycs , A no ) is in NP if and only if there 
exists a deterministic polynomial-time Turing machine M and a polynomial p, such that 
on input x G S*; 

• If x G A yes , then there exists a proof y G S p ^^ such that M accepts (x,y). 

• If x G A no , then for all proofs y G £ p d x D ; M rejects (x, y). 

Now, since we have defined our complexity theoretic model for quantum computing 
based on the quantum circuit model, we next require the notion of a polynomial-time 
uniform family of quantum circuits. Specifically, since the length of input x G S* to a 
computational problem is allowed to vary, whereas the input size to a given circuit is fixed, 
we require a method for "scaling" our circuits up to match the length of arbitrary input 
x G £*. 
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Definition 1.3 (Polynomial-time uniform family of quantum circuits). A set of quantum 
circuits {Q n } is polynomial-time uniform if there exists a polynomial-time deterministic 
Turing machine, which on input l n , outputs a description of Q n . 

We now define BQP [46], which stands for Bounded-Error Quantum Polynomial Time, 
and which is intuitively the set of promise problems which can be efficiently solved with 
high probability on a quantum computer. For both BQP and QMA, we henceforth say a 
quantum circuit Q accepts input x (where x can be either a classical string or quantum 
state) if running Q on input x and subsequently measuring a designated output qubit of 
Q in the computational basis yields outcome 1. 

Definition 1.4 (BQP). A promise problem A = (A yes , A no ) is in BQP if and only if there 
exists a polynomial q and a polynomial-time uniform family of quantum circuits {Q n }, 
where Q n takes as input a string x G £* with \x\ = n, and q(n) ancilla qubits in state 
|O) 09(n) , such that: 

• (Completeness) If x G A yes , then Q n accepts input x with probability at least 2/3. 

• (Soundness) If x G A no , then Q n accepts input x with probability at most 1/3. 

Note that if we replace the uniform quantum circuit family above with a uniform 
classical circuit family which takes as input both x and a polynomial-size string y cho- 
sen uniformly at random, then we are reduced to BPP. Like BPP, the completeness and 
soundness parameters 2/3 and 1/3 above can straightforwardly be amplified to values ex- 
ponentially close to 1 and simply by running the verification procedure Q independently 
polynomially many times in parallel, accepting if and only if the majority of runs accepted, 
and applying the Chernoff bound. We remark that BPP C BQP follows since probabilistic 
classical computations can be simulated with quantum circuits (see, e.g. [248]). The deci- 
sion versions of the factoring and discrete logarithm problems are, for example, not known 
to be in BPP, but are in BQP due to Shor's algorithm [224]. 

We next define QMA, or Quantum Merlin Arthur, a quantum generalization of NP. 

Definition 1.5 (QMA). A promise problem A = (A yes , A no ) is in QMA if and only if there 
exist polynomials p, q and a polynomial-time uniform family of quantum circuits {Q n }, 
where Q n takes as input a string x G £* with \x\ = n, a quantum proof \y) G (C 2 ) (X,p(n \ 
and q{n) ancilla qubits in state \fy® q( ^ n \ such that: 

• (Completeness) If x G A ycs , then there exists a proof \y) G (C 2 )®^™) such that Q n 
accepts (x, \y)) with probability at least 2/3. 
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• (Soundness) If x G A no , then for all proofs \y) G {fC 2 )® p<yn \ Q n accepts (x, \y)) with 
probability at most 1/3. 

It is often helpful to think of \y) above as a proof sent by an all-powerful but untrustworthy 
prover Merlin, who claims x G A yes , and to correspondingly interpret {Q n } as an honest 
but computationally bounded verifier Arthur, whose job it is to verify the correctness of 
Merlin's proof. We are not overly fond of the names Merlin and Arthur, and as such, 
prefer to simply refer to both parties in this interpretation as being the prover and verifier, 
respectively. As an aside, we remark that QMA was originally known as Bounded Error 
Quantum NP (BQNP) [171]. 

Note now that if we instead ask in the definition of QMA that y G then the 

corresponding complexity class is known as quantum-classical Merlin- Arthur (QCMA) [22, 
156, 6, 11, 39, 18, 257]. (QCMA is also known by the name Merlin-Quantum- Arthur 
(MQA), as suggested by Watrous [248].) Finally, if y is classical and we replace {Q n } 
with a classical circuit family of the type used in defining BPP, then the class we obtain is 
Merlin- Arthur (MA) [33]. 

Error reduction for QMA. Like BQP, the completeness and soundness parameters in 
the definition of QMA can be amplified to values exponentially close to 1 and 0, respectively. 
However, the arguments employed here are not as straightforward as in the case of BQP. 
For QMA, there are two approaches for achieving error reduction, which we refer to as 
weak and strong error reduction, and which we now discuss. 

Weak or standard error reduction runs analogously to the case of BQP, i.e. by running 
the verification protocol some number of times m in parallel and taking a majority vote. 
However, since from Section 1.4.5, we know that unknown quantum states cannot be 
cloned, the verifier must ask the prover for multiple copies of the proof \y), one for each 
of the m parallel runs of the protocol. If the verifier is honest, the proof sent for the new 
protocol is a product state \y') = \y)® m G (C 2 )® p (l x 'D' m , in which case the m runs of the 
verification protocol are independently and identically distributed Bernoulli trials, and the 
Chernoff bound can be applied. However, if we have a NO-instance, i.e. x G A no , then in 
a desperate attempt to trick the verifier into thinking x G A yes , the prover may elect to 
cheat by sending a proof \y') which deviates from this product state structure. Can we 
still apply the Chernoff bound argument here? 

It turns out the answer is yes, the intuition for which we now sketch. (A detailed proof 
can be found in [22].) Specifically, let V denote the original verification protocol. Then, 
given any \y') G (C 2 )®p(N)- m ; we adopt the following view: On the first p(|x|) proof qubits, 
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we run the first copy of V, measure and read the output qubit, and subsequently discard 
these p(\x\) qubits. Note that the reduced state of \y') on these first p(\x\) qubits before 
running V is simply a convex mixture of proofs \y) G (C 2 )® p ^ x ^; thus, by the soundness 
property of the QMA protocol, the probability of acceptance in this first run is at most 
1/3. We can iterate this argument over each of the remaining m — 1 copies of V, each 
time obtaining a probability of accepting of at most 1/3. It follows that a majority vote, 
coupled with the Chernoff bound, yields the desired error reduction. 

Finally, although weak error reduction is simple, its disadvantage is that it requires 
an increase in the proof size, since the prover must send multiple copies of the original 
proof. Is it possible to reduce the error without increasing the proof length? Remarkably, 
Marriot and Watrous have shown [191] that the answer is yes. The rough idea here is best 
illustrated in the case of a zero-error verifier V, i.e. where the completeness and soundness 
parameters are 1 and 0, respectively. Specifically, let V be a zero-error verifier V, and \y) 
the prover's proof for some instance x G A yes . Then, if we run V on (x, \y)) and measure the 
output qubit, we will see outcome 1 with certainty. Thus, the measurement does not alter 
the output state of V. Further, if we now run V in reverse and measure the ancillary qubits 
of V, they should read all zeroes with certainty, implying this second measurement also 
does not alter the state being measured. In fact, we can repeat this back and forth process 
as many times as we like, each time obtaining the same "good" measurement outcomes. 

What happens now if we do not have a zero-error QMA verifier V, and have a NO 
instance x G A no l In this case, the output qubit of V\x) £g> \y) must yield outcome 1 
with probability at most 1/3 — in other words, measuring this qubit now disturbs the 
state V\x) ® \y). Moreover, when we next apply and measure the ancilla qubits, since 
V is unitary, the outcome cannot be the all-zeroes string with non-negligible probability, 
again disturbing the state. Intuitively, by repeating this back-and-forth procedure, we thus 
quickly amplify the likelihood of obtaining "bad" measurement outcomes in this process. 
In our opinion, the entire process can be thought of as analogous to a spinning top — if 
the top wobbles badly enough to begin with (if x G A no ), the spinning motion (the back 
and forth measurement process) quickly sends the top out of control. 



1.5.3 BQP and QMA in further depth 

As QMA plays an important role in this thesis, we now further discuss its properties, 
variants, and complete problems. Along the way, we also mention some further properties 
of BQP. 
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First, we have 

NPCMAC QCMA C QMA C PP. (1.67) 

Here, PP is defined analogously to BPP, except that when input x G ^4 yes , then the verifier 
accepts with probability strictly larger then 1/2; if x G A no , the verifier accepts with 
probability at most 1/2. The second of the containments above follows since a QCMA 
verifier can choose to act classically. The third containment holds since a QMA verifier 
can force a given quantum proof to encode a classical string by preceding the verification 
procedure with a measurement in the computational basis. Finally, the fourth containment 
has an elegant proof via the strong error reduction technique of Marriott and Watrous [191], 
and was originally proven by Kitaev and Watrous [172]. 

Regarding BQP, we have that 

BPP C BQP C QMA, (1.68) 

where the second containment follows since the verifier can simple flush the prover's proof 
down the toilet and run the BQP circuit instead. Combining Equations (1.67) and (1.68) 
yields BQP C PP; we remark that this containment was directly proven by Adleman, 
DeMarrais, and Huang [14] and Fortnow and Rogers [100]. Marriott and Watrous have 
shown that BQP = QMA log [191], where QMA log is QMA with a logarithmic size proof. 
The classical version of this equality might be written P = NPi og , i.e. NP with logarithmic 
size proofs is contained in P. Finally, it is well-known that in the classical setting, BPP C 
S2 [ 17, 177], for Eg the second level of the polynomial hierarchy PH. Whether BQP C PH, 
however, remains a major open question [8, 93, 9]. 

One-sided error. Next, we discuss the one-sided error versions of MA, QCMA, and 
QMA. Specifically, let MA 1; QCMA 1; and QMA X be defined as MA, QCMA, and QMA, 
respectively, except with completeness 1 in each case. In other words, if x G A yes , the 
verifier for the new classes accepts with certainty. Zachos and Furer have shown that MA = 
MAi [262] (see also Goldreich and Zuckerman [116]), and more recently, Jordan, Kobayashi, 
Nagaj, and Nishimura have proven that QCMA X = QCMA [159]. Whether QMA X = QMA, 
however, remains an interesting open question, particularly since both QCMA and QIP(3) 
in the chain QCMA C QMA C QIP(3) allow one-sided error [172]. Here, QlP(jfe) is the 
class of promise problems having Quantum Interactive Proofs with k rounds, meaning it 
is a generalized version of QMA in which k quantum messages are passed back and forth 
between prover and verifier. For example, BQP = QIP(O), QMA = QIP(l), and QIP(3) 
consists of a message from prover to verifier, followed by a message from verifier to prover, 
and a final message back from the prover to the verifier. Aaronson has demonstrated a 
quantum oracle relative to which QCMA X C QCMA and QMA X C QMA [7]. 
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Complete problems. We now move to arguably one of the most important questions for 
any complexity class: What problems characterize, or are complete for QMA ? In general, 
the set of QMA-complete problems is not yet nearly as rich as that for its classical cousin, 
NP. The historically first QMA-complete problem was the local Hamiltonian problem (first 
presented by Kitaev at [170], and later written up in [171]), which is a natural generalization 
of the NP-complete problem of classical constraint satisfaction, and relevant from a physics 
perspective. In fact, we devote Section 1.5.4 entirely to this problem and its variants, and 
thus do not discuss it further here. 

Perhaps the second-most studied and natural QMA-complete problem is the Consis- 
tency problem for local density matrices of Liu [182]. In this problem, one is given a 
classical description of a set of density matrices p$, each acting on a subset S C [n] qubits 
for 1 5" | = k and k G 0(1). The question is whether there exists a globally consistent n-qubit 
state p such that Ti^\ s (p) = p$ for all S. The proof of QMA-hardness for k = 2 follows 
via a polynomial-time Turing or Cook reduction involving convex programming from the 
2- local Hamiltonian problem [182]; the reduction in the reverse direction was later given 
by Liu in [183], and goes via a strong theorem of alternatives in semidefinite programming. 
Other physically motivated variants of the Consistency problem have also been shown to be 
QMA-complete: The variant involving fermions, known as the N-representability problem, 
was shown QMA-complete by Liu, Christandl, and Verstraete [184], as well as its bosonic 
counterpart by Wei, Mosca, and Nayak [253]. 

What other QMA-complete problems are known? Given a classical description of 
a quantum circuit, the problem of determining whether it is "close" to the identity, 
known as the Identity Check problem, was shown QMA-complete by Janzing, Wocjan, 
and Beth [157]. Rosgen [215] has shown that a similar problem where one is asked whether 
a given quantum circuit is close to a linear isometry is QMA-complete. Finally, Beigi and 
Shor [ 10] have proposed a QMA-complete quantum generalization of the Clique problem, 
which asks: Given an (entanglement-breaking) channel $, do there exist k quantum states 
p which are distinguishable without error after passing through the channel? 

Multiple provers. QMA is a proof system with a single prover and verifier. A curiosity 
emerges when we ask the question: What happens to the power of the proof system if 
we introduce a second prover? In other words, what if there are two provers, Pi and 
P2, who send a joint proof of the form \ipp) <8> \ipo) to the verifier? Interestingly, unlike 
the classical setting where having two provers is trivially equivalent to having a single 
prover, in the quantum setting, the possibility of entanglement between the two proofs 
(entanglement is introduced in Section 1.6) makes this a non-trivial question. This class 
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is called QMA(2) [175]. Why should it be of any interest? Perhaps surprisingly, Blier 
and Tapp [48] have shown that all languages in NP have very short proofs in this model; 
specifically, it suffices for P 1 and P 2 to send proofs l^p,} and \ipp 2 ), respectively, consisting 
of just O(logra) qubits each. The reader is referred to Chapter 4 for formal definitions and 
details regarding this model, where it is studied in further depth. 

1.5.4 Local Hamiltonian complexity: An overview 

In Section 1.5.3, we initiated our discussion of QMA-complete problems, and stated that 
the first known such problem was the local Hamiltonian problem. As this problem features 
heavily in Chapters 2 and 3, we now discuss it in further depth. We begin by defining 
the problem, and follow by demonstrating how it generalizes the canonical NP-complete 
problem MAX-SAT. We then discuss some of its variants and its history with respect to the 
field of complexity theory. Later in Section 1.5.5, we give Kitaev's proof that the 5-local 
Hamiltonian problem is QMA-complete. 

Beginning with definitions, the local Hamiltonian problem (LH) was introduced by 
Alexei Kitaev [170, 171], and can intuitively be thought of as follows: Given a "succint" 
representation of a "large" Hamiltonian H, what is H's smallest eigenvalue? Of course, 
the obvious approach to answering this question is to diagonalize H — however, the catch 
is that while H is a 2 n x 2 n -dimensional matrix, the succinct encoding we are given of 
H consists of poly(n) bits. In other words, a simple diagonalization approach would take 
time exponential in the input size. 

Let us now define LH more formally. To do so, we first define the term k-local Hamil- 
tonian. 

Definition 1.6. An operator H e l-L(B® n ) is called a k-local Hamiltonian if it can be 
written 

r 

H = J2 H i> (1-69) 

where {Hj} r . =1 C l-l{B® k ) is a collection of local Hamiltonian terms, such that each Hj acts 
non-trivially on some subset Sj C [n] of at most k qubits and satisfies ^ Hj I ■ Note: 
In Equation (1.69), we adopt the convention that each Hj acts as the identity on all qubits 
in the set [n]\Sj. 

Note that although we define H as acting on qubits above, the definition extends straight- 
forwardly to the case of higher-dimensional local systems. Intuitively, the definition above 
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says that a fc-local Hamiltonian H can be expressed as a sum of "smaller" Hermitian 
operators Hj, each of which is restricted to act non-trivially on at most k out of n qubits. 

We now phrase the k-LH problem. We remark that later, in Chapter 2, we shall 
formulate k-LH in a slightly different manner; the definition below is, however, arguably 
more natural and thus better suited to an introductory section. 

Problem 1.7 (/c-Local Hamiltonian (k-LH) [171]). Given as input: 

1. A k-local Hamiltonian H acting on n qubits, specified as a collection of local Hamilto- 
nian terms {HjY- =l C *H(13® k ) (i.e. as a collection of (2 k x 2 k )- dimensional matrices 
Hj) where k G 0(1), 

2. Threshold parameters a, b G M., such that < a < b and (b — a) > 1, 

decide, with respect to the complexity measure (H) + (a) + (b) : 

1. If X m m(H) < a, output YES. 

2. I/A min (#) > b > Output NO. 

Note that often k-LH is phrased with (b — a) > l/p(n) for some polynomial p; such 
an inverse polynomial gap can straightforwardly be boosted to the constant 1 above by 
defining H to have p(n) many copies of each local term Hj [248]. 

Although it may not be a priori obvious, k-LH generalizes the canonical NP-complete 
problem MAX-k-CSP, where CSP stands for Constraint Satisfaction Problem (of which a 
special case is the more familiar problem MAX-k-SAT). To see this, recall that in MAX- 
k-CSP, one is given a set of Boolean functions, q : {0, l} fc i-> {0, 1} (note the q are not 
restricted to be of any particular form such as conjunctive normal form), where each Cj 
acts on k out of n possible bits. We then ask: What is the largest number of clauses q we 
can satisfy with a Boolean assignment to the n bits? To embed this problem into k-LH, we 
design a fc-local Hamiltonian H acting on n qubits as follows. For each clause q, define a 
2 k x 2 fc -dimensional diagonal matrix H Ci 6 H,(B® k ) such that H Ci (m,m) = if the binary 
representation of m is a satisfying assignment for clause q; otherwise, H c .(m,m) = 1. In 
other words, for x G {0, l} fc , Tr(if c . |x)(x|) = if x satisfies Cj, and Tr(H c . \x) (x\) = 1 
otherwise, i.e. failing assignments are given an energy penalty. To now see that the optimal 
value of our MAX-/c-CSP instance corresponds to the smallest eigenvalue of H = ^2 c H Ci , 
we use the fact that since all the H Ci are diagonal, they commute and thus simultaneously 
diagonalize. Hence, H has integer eigenvalues. Moreover, since the H Ci are simultaneously 
diagonal in the computational basis, the smallest eigenvalue of H equals the minimum 
number of unsatisfied clauses over all n-qubit computational basis states. It follows that 
k-LH generalizes MAX-k-CSP, and thus k-LH is NP-hard. This raises the natural question: 
Could k-LH be a canonical QMA-complete quantum constraint satisfaction problem? 
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Variants of k-LH and a brief history. It turns out that k-LH is indeed QMA-complete; 
Kitaev [170, 171] showed the problem to be in QMA for k > 1 and QMA-hard for k > 5. 
The proof of QMA-hardness was inspired by earlier ideas of Feynman [171, 97], and can 
be thought of as exploiting Feynman's ideas to adapt the classical Cook-Levin theorem in 
a non-trivial fashion to the quantum setting. The fact that 3-LH is also QMA-complete 
was shown subsequently by Kempe and Regev [164] (an alternate proof was later also 
given by Nagaj and Mozes [19! ]). Finally, Kempe, Kitaev, and Regev showed [163] that 
even 2-LH is QMA-complete. Note that 1-LH is in P, since one can simply optimize for 
each 1-local term independently. Although these results are interesting from a complexity 
theoretic perspective, a more natural question from a physics perspective is whether such 
QMA-hardness results can be shown even if the QMA-hard classes of local Hamiltonians 
arising in the reductions employed correspond to physical quantum systems in nature [1 , 
20, 198, 223]. Along these lines, Oliveira and Terhal next showed [202] that 2-LH with 
the Hamiltonians restricted to nearest-neighbor interactions on a 2D grid is still QMA- 
complete. Furthermore, in stark contrast to the classical case of MAX-2-CSP on the line 
(which is in P), Aharanov, Gottesman, Irani and Kempe [ ] showed that 2-LH with 
nearest-neighbor interactions on the line is also QMA-complete if the local systems have 
dimension at least 12 (Nagaj later improved this to 11 states per particle [198]). 

Although this thesis focuses on the general local Hamiltonian problem as defined in 
Definition 1.7, for completeness, we now mention a few interesting variants of LH which 
have also been studied. First, Bravyi and Vyalyi showed that the variant of 2-LH (with 
local systems of arbitrary, but constant, dimension) in which all local Hamiltonian terms 
Hj pairwise commute is in NP. This result was extended to the case of 3-LH on qubits by 
Aharonov and Eldar [19]. Bravyi [55] introduced a variant of k-LH known as Quantum k- 
SAT, in which each local Hamiltonian term Hj is a projector, and in which the threshold a 
is set to 0. We remark that in the YES case of such a setup, the local Hamiltonian is referred 
to as frustration-free, since the optimal assignment lies in the null space of every interaction 
term. Bravyi then showed that, like classical 2-SAT, Quantum 2-SAT is in P (whereas 
recall 2-LH is QMA-complete) [55]. In contrast, Quantum 4-SAT is QMA-complete (recall 
QMA X is the one-sided error analog of QMA) [55] . Whether Quantum 3-SAT on qubits 
is QMAj-complete remains an intriguing open question (see Reference [199]). Next, there 
has been a line of work on so-called stoquastic local Hamiltonians [56, 58, 59, 183, 158]. 
Specifically, the Stoquastic fc-SAT problem, defined the same as Quantum /c-SAT except 
that all local projectors have real non-negative matrix elements when expressed in the 
computational basis, was shown to be in MA for k > 1, and MA-complete for k > 6 [56, 59]. 
(Incidentally, this was the first non-trivial example of an MA-complete promise problem.) 
The problem Stoquastic LH-MIN, defined as k-LH except where each local Hamiltonian 
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constraint Hj has real non-positive off-diagonal matrix elements in the computational basis, 
was shown complete for the class StoqMA [ ] for k > 2. Here, StoqMA is a variant of 
QMA in which the verifier is restricted to preparing qubits in the states |0) and |+), 
performing classical reversible gates, and measuring in the Hadamard (i.e. |+), |— )) basis. 
Note that MA C StoqMA C QMA. Finally, variations of LH with symmetry constraints 
have been studied from a complexity theoretic perspective in, for example, [117, 161]. 

Connection to physics. Although we have primarily discussed LH from a complexity 
theoretic viewpoint involving quantum constraint satisfaction, the initial motivation for 
studying LH comes of course from physics. Indeed, the study of the local Hamiltonian 
problem is part of the more general field of Hamiltonian Complexity, whose aim is to 
understand how difficult it is to simulate physical systems. In particular, LH can be 
phrased as a special case of the more general Simulation Problem [20 1] , which roughly asks 
the following: Given a description of a Hamiltonian H, an initial state p, an observable 
M, and a time ieC, estimate the expectation 

tj,m\\ im - 

The local Hamiltonian problem is recovered by choosing H as a local Hamiltonian, setting 
M = H , p = I/Tt(I), and considering t — i/3 for G R and — > oo. We refer the reader 
to the survey of Osborne for further details [204]. 

1.5.5 Kitaev's quantum Cook-Levin theorem 

In Section 1.5.4, we discussed the local Hamiltonian problem (LH) and its variants. As 
Chapter 3 heavily exploits the structure and details of Kitaev's quantum version of the 
Cook-Levin theorem, i.e. his proof that 5-LH is QMA-complete, we present the latter here. 
This requires two steps: One first shows that k-LH G QMA for k > 1. One then shows 
that k-LH is QMA-hard for k > 5. Our discussion is based on a project completed by the 
present author for a graduate course on quantum complexity theory at the University of 
Waterloo [104], and follows the text of Kitaev, Shen, and Vyalyi [171] closely. The reader 
is referred to the survey of Aharonov and Naveh for an alternate exposition [22]. 

Local Hamiltonian is in QMA 



We begin by showing that k-LH G QMA for any constant k. Specifically, for any YES- 



instance (H, a, b) of k-LH with /c-local Hamiltonian H = X^=i Hj ^ £(£>® n ), we show that 
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there exists a poly-size quantum proof \ip) and a poly-size quantum verification circuit V, 
such that a single-qubit measurement on V\ip) yields 1 with high probability. 

First, the quantum proof is constructed as 6C r ® B® n (8) B as: 

M=^£|j))®fo)®|0), (1-71) 

for {|j)}j =1 an orthonormal basis for C r , and \rj) an eigenvector corresponding to some 
eigenvalue A of H. We call the first register of \ip) the index register, the second the proof 
register, and the last the answer register. 

To define the verification procedure V, recall that H = X^=i-^j> where each Hj 
acts on the set of qubits denoted by Sj. Suppose Hj has spectral decomposition Hj = 
^2 S K\K)(K\- Then, define unitary Wj acting on the proof and answer registers, i.e. 
Wj eU{B n ®B), such that 

Wj (|A S ) ® |0» = |A S ) ® (a/aTIO) + Vl-A.ll)) . (1.72) 

Observe that one can implement this operation as follows. First, run phase estimation on 
exp(iHj) to extract A s to some ancilla register. Despite the fact that simulating exp(iHj) 
can in general be costly, in our case, since \Sj\ is constant, the simulation can be done 
efficiently Conditioned on the value of the ancilla, we then rotate the answer register 
to obtain the desired superposition, and finally uncompute X s in the ancilla. Define now 
unitary V := £J =1 \j)(j\®W r 

Having defined \ip) and V, the verification procedure now proceeds as follows: 

1. Apply V to 

2. Measure the answer register and return the result. 

Let us analyze the probability of measuring 1 in the answer register with this procedure. 
If we assume the index register is implicitly measured at the end of the verification, then 
we can think of Step 1 above as using the index register to choose an index j uniformly 
at random, followed by applying Wj to the proof register. Then, we can analyze the 
probability that this procedure returns 1 as follows: 

r 1 

Pr(output 1) = ~ Pr(output 1 | Wj is applied), (1.73) 
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where one has 

Pr(output 1 | Wj is applied) = Tr \(I B m <g> \l)(l\)Wj{\r]}{r]\ <8> |0)(0|)Wjl (1.74) 

= «r7| <8) <0|)1^;(J^ (g> Il><l|)V^(|77> <S> |0». (1.75) 

The projector |1)(1| above acts on the answer register. To simplify this, rewrite \rj) in the 
eigenbasis of Hj, i.e. \rj) = ^ s « s | A s ) , and observe that 

(/ B ®n <g> (l\)Wj(\rj) <g> |0)) = (Jb®« <g> (l|)Wj ( ^ a s |A s ) ® |0) J (1.76) 



(I B ®»®(1|) ^a s |A s ) ® (s/KIO) + v 7 ! - Al|l> 
= ^a s (Vl-A s ) |A S ). (1.77) 

s 

Substituting this into Equation (1.75), we obtain: 

Pr(output 1 | Wj) = (y^at (v 7 ! - A) (A t |J fca, (0 - K) 

= ^(1-A s )|a s | 2 (1.78) 

s 

= 1-^A S |« S | 2 (1.79) 



.s 



= l-fal^M, (1-80) 

where we have used the fact that J2 S \ a sf = 1- Substituting this into Equation (1.73) 
finally yields: 

Pr(output 1) = J2 1 1- ( V \H 3 \ V ) = 1 - 1 ( V \ (j^Hj) \v) = l~ 1 W\V)- (1-81) 

Recalling that we chose r\ to be an eigenvector of H with some eigenvalue A, we have that 
if H corresponds to a YES instance (i.e. there exists A < a), it follows that we can choose 
77 such that our verification procedure returns 1 with probability 1 — r~ l \ > 1 — r~ l a. On 
the other hand, if H corresponds to a NO instance (i.e. for all A, we have A > b), we have 
Pr(output 1) < 1 — r _1 &. Since the probabilities in the YES and NO cases differ by an 
inverse polynomial in the input size, we can apply the error reduction techniques for QMA 
discussed in Section 1.5.2 to conclude that LH £ QMA. 
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5-local Hamiltonian is hard for QMA 



We next show that 5-local Hamiltonian is QMA-hard. To do so, we show a polynomial-time 
many-one or Karp reduction from an arbitrary problem in QMA to 5-LH. 

To begin, let P be a promise problem in QMA, and let V = VlVl-i . . . V\ be a verifi- 
cation circuit for P composed of unitaries Vk- Without loss of generality, we assume each 
\4 acts on pairs of qubits. We assume V G U{B® m <S> B® N ~ m ), where the m-qubit register 
contains the proof V verifies, and the remaining qubits are ancilla qubits. 

Our goal is to define a 5-local Hamiltonian H that will have a small eigenvalue if and 
only if there exists a proof G B® m causing V to accept with high probability. Kitaev's 
idea [171] was to exploit the structure of V by forcing the minimizing eigenvector of H 
to "simulate" the action of V. To do so, let H act on B® m ® B® N ~ m <g> C i+1 , which is 
simply the initial space V acts on, tensored with an (L + l)-dimensional counter or clock 
register. This clock register will "keep track of time" in the simulation, i.e. a value of k in 
the register will correspond to having "applied" V± ... For clarity of exposition, where 
necessary, we label the three registers H acts on as p for proof, a for ancilla, and c for 
clock, respectively. 

Having defined the space H acts on, we now define H itself: 

H := H in + ifprop + H out , (1-82) 

with the terms H in , H pTop , and H ont defined as follows (intuitive explanations to follow). 
Let 

H m := I p ® (4 - |0 . . . 0) (0 . . . 0|J ® |0> (0| c . (1.83) 

Note that the projector (I a — |0 . . . 0)(0 . . . 0| a ) is used here for simplicity of exposition; the 
same analysis holds if we instead use the 1-local constraint ^j = ^ m (|l)(l|i)c (where the ith 
projector acts on the ith ancilla qubit) — hence, we do not violate the constraint that H 
be 5-local. Next, H ont is defined as 

#out := (|0>(0| ® 7^™-i) p ® I a ® \L)(L\ c . (1.84) 

Finally, define H prop as 

L 



#pro P := y~]Hj, where 



i=i 



H 



3 ■= -\v 3 ®\3){3 ~l\ c -\v ] 3 ®\j - \){j\ c + (1.85) 
\l®{\3){j\ + \j-l)(3-l\) c . (1.86) 
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Each of the terms H m , H out} and H pmp allow us to "force" the minimizing eigenvector 
of H to "simulate" V as follows. Recall that our goal is to have (t]\H\t]) for some \r]) G 
E® m g> B® N ~ m g> C i+1 be small if and only if V outputs 1 with high probability on some 
proof \ip) G B® m . Suppose such a exists. Then, for H in , note that when one runs V on 
\ip), the initial state should be \i)) p ® \0)^ N ~ m , i.e. all ancilla qubits should be set to 0, with 
the purported proof in the proof register. But H m enforces precisely this constraint for any 
I77). In particular, if the clock register of \r]) is in state |0)(0| c and the ancilla register is not 
all zeroes, then we have (rj\Hi n \rj) > 0, i.e. (77) incurs an energy penalty. In other words, 
if | rj) does not simulate the initial state of the verification procedure V, H in penalizes \rj). 
Next, for H ont , note that after running V on we expect the first qubit in the proof 
register to be a 1 with high probability. Again, observe that H out enforces exactly this 
constraint on \rj) — if the clock register is in state \L)(L\ C and the first qubit reads 0, we 
again have {r]\H out \r]) > 0. Finally, H prop follows the same idea by forcing \rj) to encode in 
superposition a simulation of each step of the verification procedure V. It follows that the 
minimizing vector \rj) is of the following form, often called a history state: 

\V) ■= £ Wi ■ ■ ■ ® \0)T~ m ) ® \J) C - (1-87) 



To recap, if there exists a such that V accepts with high probability, then the history 
state 1 77) corresponds to a small eigenvalue of H. On the other hand, if no such 
exists, either \rj) will be of the form in Equation (1.87) (i.e. will faithfully simulate V), 
in which case we are hit with a large penalty by H out since the answer qubit cannot be 
1 with high probability, or "cheats" by deviating from either the initial conditions or 
the intermediate steps of the protocol, in which case the terms H m and H out hit |r/) with 
an energy penalty, respectively. Thus, the corresponding energy of \r)) would be large. Of 
course, it remains to show that this intuition is indeed correct! 

Before we begin, we first apply the following change of basis operator to H prop , which 
greatly simplifies the analysis (intuition to follow): 

L 

W = Y,Vj...V 1 ®\j){}\ e . (1.88) 

3=0 

Thus, instead of \rj) and H, we consider \rj) := W\rj) and H := W^HW . To see what H 
looks like, we analyze the action of W on each of H in , H out , and H pmp separately. Observe 
first that ifin := W^H in W = H in , since at time 0, W implicitly applies the identity to the 
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proof and ancilla registers. Second, for H out , we have 



H, 



out 



W^H mit W = 



(|0)(0| ® I B ® 



■ -i ) p ® 4 



7 (8) |L)(L| C = (yt g, I c )H out (V ® J c ), 

(1.89) 

since at time L, W applies the entire circuit V. Finally, for H prop , considering the effect of 
W on each component of Hj in Equation (1.86) separately and using simple algebra, one 
finds 

W^HjW = I p , a ®\{\3 - - - - l\ + \3){j\)c = h,a®\ ( 1 _1 



-1 1 



It follows that H prop = Ylj W^HjW is tridiagonal and of the form 



1.90) 



Hp r0 p — Ip ® la ® 



I -§ 

— I 1 — § 

-\ 1 -\ 

o o -I i -I 







V : 



=: I v ®l a ® E c 



J 



(1.91) 



where we have let E denote the tridiagonal matrix acting on the clock register for later 
reference. Intuitively, one can think of the change of basis W as "flushing out" the com- 
putation V, so that it is pushed to the very end to time step L (hence V only appears in 
H out ). This has the effect of simplifying H prop to a nice tridiagonal form, since it no longer 
needs to keep track of the unitaries Vi. 

Finally, observe that since W is unitary, H and H have precisely the same set of 
eigenvalues. We can thus work with H instead of H in our eigenvalue analysis. Hence, for 
the remainder of this section, by \rj) we shall mean \fj), and by H, we mean H. We now 
show that H has the correct spectral properties for both YES and NO instances of 5-LH. 



YES case: H has a small eigenvalue 

We have thus far set up a Hamiltonian H G T-L(B® m <8> B® N ~ m <8> C L+1 ) corresponding to 
the verification procedure V G U(B® m <g> S® N ~ m ). We now show that if there exists such 
a \if}) G B® m which causes V to output 1 with high probability, then H must have a small 
eigenvalue. 
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Suppose there exists such that a measurement of the first qubit of V\ip) yields 1 
with probability at least 1 — e. To demonstrate that 77 has a small eigenvalue, we explicitly 
construct a vector \rj) E B® m <g> B® N ~ m <g> C L+1 such that (rj\H\rj) is small. Let 



i^) = i^) P ®io)r" m ®i7) c , 



where 



|7> == 



\fL + 



(1.92) 



;i.93) 



We analyze (i]\H\i]) by considering H [n , 77 prop , and 77 out separately. First, observe that 
(r)\Hi n \r)) = 0, since the ancilla register of |?7) is in the all zeroes state. For 77 prop , we have 
that 

(77|tf pr op|77> = (v\I P ,a ® E c \r]) = ( 7 |£| 7 ) = 0, (1.94) 

where in the last equality we have used the fact that the sum of each row and column of 
E is 0, implying | 7 ) is a 0-eigenvector of E. Note that we have not used the probability of 
V answering 1 yet — this now comes in handy for 77 out , where 



(r)\H out \r)) = (|0)(0| (gi/^-i), 



V®\L){L\ e )\ V ) 



7 + 1 L 



0(O |^-ytl [(|0)(0|®7 e ^-0 p ' 



(1.95) 

^^) P ®io)r- m] 



Observe, however, that this expression corresponds to the probability that we begin with 
the proof \ip) p <S> \0)^ N ~ m , apply the verification V, and then measure the first qubit and 
obtain 0. By our assumption at the beginning of this section, this probability is at most e. 
Hence, 

(77|77 out |77> < j^e, (1.96) 

implying there must exist an eigenvalue for H of value at most ej (7 + 1). Thus, if we have 
a YES-instance of our QMA problem P, then H has a small eigenvalue, as required. 



NO case: H has no small eigenvalues 

We now show that if there does not exist such a proof \ip) which causes verification proce- 
dure V to output 1 with high probability, then H must have no small eigenvalues. 

Suppose that for all proofs V does not output 1 with probability more than e. 
To lower bound the eigenvalues of 77, we play a game of divide-and-conquer by letting 
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H = A% + A 2 , where A\ := H m + H out , and A 2 : = H pmp , and analyzing the eigenvalues 
of A\ and A 2 separately. The challenge arises in combining these separate eigenvalue 
estimates into eigenvalue estimates for H, since unfortunately [Ax, A%] 7^ 0, implying that 
Ax and A 2 do not diagonalize in a common basis. To surmount this obstacle, Kitaev uses 
the following approach [171]: 

1. We first prove Kitaev's Geometric Lemma (Lemma 1.8), which takes as input oper- 
ators B and C, as well as a set of parameters S dependent on B and C, and outputs 
a lower bound on the eigenvalues of B + C. 

2. We compute the parameters S relevant to our specific operators Ax and A 2 , and plug 
them into Lemma 1.8 to show that H = Ax + A 2 has no small eigenvalues. 

We now state and prove Kitaev's Geometric Lemma. 

Lemma 1.8 (Kitaev, Shen, Vyalyi [171], Geometric Lemma, Lemma 14.4). Let A 1} A 2 >z 

0, such that the minimum non-zero eigenvalue of both operators is lower bounded by v. 
Assume that the null spaces Cx and C 2 of Ax and A 2 , respectively, have trivial intersection, 

1. e. Cx H C 2 = {0}. Then 



where the angle a(X, y) between X and y is defined over unit vectors \x) and \y) as 
cos [Z(X, y)] := m&x\ x)eXi]y)E y | (x\y) \ . 

Note that if X and 3^ have non-trivial intersection, i.e. there exists \x) ^ such that 
\x) G X and \x) G y, then a(X, y) is trivially 0. Also, note that demanding X ny = {0} 
is not equivalent to demanding X and 3^ be orthogonal — for example, the spaces X = 
span{|0)} and y = span{|+)} contain elements which have non-zero overlap, but the sets 
have trivial intersection. 

We now tackle step 1 of Kitaev's approach by proving Lemma 1.8. 

Proof. By the definition of v, we have Ax t: v(I — H-d) and A 2 >z v(I — Il£ 2 ), where 
Tlx denotes the projector onto X . Combining the latter two, it follows that it suffices to 
show v(I — Hci) +v(I — l~lc 2 ) ^ 2f sin 2 (a(£i, C 2 )/2). By rearranging terms and using the 
identity cos(2#) = 1 — 2sin 2 #, this is the equivalent of showing 



Ax + A 2 y 2v sin 



2 a{Cx,C 2 ) 
2 



(1.97) 



n £l + n £2 ■<[! + cos(a(£i, £ 2 ))] /. 



(1.98) 
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To upper bound the eigenvalues of IL^ + Hc 2 , suppose we have some eigenvector \() 
with corresponding eigenvalue A > 0. Let \x\) G C\ and \x 2 ) G C 2 be unit vectors such 
that IIjCi IC) = u i\ x i) an d n^l^) = u 2 \x 2 ) for some real Wi,W2 > 0. Then: 

A = (C|(n A +U C2 )\0 =u 1 ((\x 1 ) + u 2 ((\x 2 ) = u\ + u\. (1.99) 

Further, since X\() = (Jlci +^-c 2 )\0 — u i\ x i) + u 2 \x 2 ), we can also derive a non-equivalent 
expression for A 2 , i.e. 

A 2 = [(C|A][A|C>] = (u 1 (x 1 \+U2(x 2 \)(u 1 \x 1 )+U2\x 2 )) = ul + u 2 2 + 2u 1 u 2 Re(x 1 \x 2 ), (1.100) 

where Re(x) denotes the real part of x G C. Combining Eqns. (1.99) and (1.100) by taking 
the following linear combination, we have: 

(1 + |Re(xi|x2)|)A — A 2 = (1 + |Re(xi|x2)|)(w 2 + u\) — (u\ + u\ + 2u\u 2 ~Re(xi\x 2 )) 

= ul\Re(xi\x 2 )\ + ul\Re(xi\x 2 )\ — 2uiu 2 Re(xi\x 2 ) (1.101) 

= |Re<aJi|x 2 >| (u\ + ul±2 Ul u 2 ) (1.102) 

= \Re(x l \x 2 )\{u l ±u 2 f (1.103) 

> 0. (1.104) 

Moving A 2 to the right side of the last inequality and dividing through by A hence gives 

A < (l + |Re<xi|ar 2 )|) < 1 + cos(a(£i, £ 2 )), (1-105) 

where the latter inequality follows straightforwardly from the definition of a(Li,C 2 ). We 
thus have that all eigenvalues of Il^+n,^ are upper bounded by l + cos(a(£i, C 2 )), which 
by Equation (1.98) implies the desired lower bound on Ai + A 2 . □ 



We now move to step 2 of Kitaev's approach, i.e. we now use Lemma 1.8 to lower bound 
the eigenvalues of H = A\ + A 2 . To do so, we must determine the values of parameters 
v and a(£i,£ 2 ) for A\ and A 2 used in Lemma 1.8. Recall that Lemma 1.8 also requires 
C\ PI C 2 = {0} — we handle this constraint at the end of the section (at which point it will 
be obvious, given the analysis to come). 

We start with v, which is the lower bound on the positive eigenvalues of both A\ and A 2 . 
Note that since A\ = H in + H out is simply a sum of commuting projectors, its eigenvalues 
must be non- negative integers. In particular, its smallest positive eigenvalue is at least 1. 
For A 2 , since A 2 = H pTop = I Pta ®E C) its eigenvalues will be determined by those of E c . The 
eigenvalues of the latter are [171] A^ = 1 — cos[7rfc/(L + 1)] for < k < L. This expression 
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is clearly minimized when k = 1 (note k = would yield a zero eigenvalue), implying the 
smallest positive eigenvalue of A 2 is at least 

1 - cos(tt/(L + 1)) > c/L 2 (1.106) 

for some constant c. To see why this inequality holds, use the Taylor series expansion for 
cosx to show that whenever x < 1, one has 

■X.' iX' tX,' tX,' lX' I J- J. \ O O* v 

cosx = l-- + --- + ...<l-- + -<l-^---jx 2 = l-cx 2 . (1.107) 

Taking the minimum of our lower bounds for Ai and A 2 thus yields that v G Q(l/L 2 ). 

We next estimate the angle a(Ci, C 2 ) between the null spaces C\ and £ 2 of A\ and yl 2 , 
respectively. This can be done by exploiting the structure of Ci and £ 2 - In particular, we 
have that 

C x = [(^ m ) p ® |0)^" m ® |0) c 

[(B® JV ) p , a ®span(|l),...| J L-l)) c ] © 

[v+di)®^- 1 )^®!^, (i- 108 ) 

where each of the three terms in this expression follow directly from the definitions of H in 
and /T ou t (e.g. any state with the clock register set to and all zeroes in the ancilla is a 
0-eigenvector of both H in and H ont ). Similarly, we have 

A = (i3 0iV ) p , a ®|7) c , (1.109) 

which follows straightforwardly if we recall that H prop = l va ® E c and E\j) = 0, for I7) 
defined in Equation (1.93). 

To exploit this structure, instead of estimating a(£i,£ 2 ), we estimate cos 2 a(£i, £ 2 ), 
which can be rewritten in the form (where the maximization is over unit vectors): 

cos 2 a(£i,£ 2 ) = m&x lx)eCl ,\ y )ec 2 \ {x\y}\ 2 = max lx)eCl ,\ y )ec 2 (y\x)(x\y) = max ly)eC2 (y\U Cl \y). 

(1-110) 

The last equality holds without loss of generality since the maximum for (ylUc^y) is 
achieved by projecting onto a pure state \x)(x\ for some \x) G C\. Let us upper bound the 
rightmost term in the equation above. Observe that by Equation (1.109), any \y) G £ 2 has 
the form \y) = \0 p ^ ® |7) c for some |C) G E® m ® B m ~ m . Since by Equation (1.108), U Cl 
breaks down into a sum of three projections, we can bound (ylUc^y) by determining the 
contribution of each projector separately when sandwiched by \y). 
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The contribution of the second projection is easiest to see — it is simply (L — 1)/(L+1), 
since every term in I7) except |0) and \L) contribute 1/(L + 1) to the sum. 

As for the first and third projections, let K x = B® m ®\0) m ~ m and K 2 = V^\l) giB®*" 1 . 
Then the contribution of the first and third projections is given by: 

(vWto ® |o)(o| c + u K2 ® \L)(L\ c )\ y ) = ^(CKn^ + n^)|c>. (1.111) 

If we let <^(/Ci,/C 2 ) denote the angle between JCi and /C 2 , we can straightforwardly use 
Equation (1.98) to bound the quantity above by 

' (C|(n Kl + n^)|C) < (l + cos^/d./Ca)). (1.112) 



L + l 

Observe, however, that 



cos 2 p(£i,/C 2 ) = max\ k)eKlt \i) eK2 \(k\l)\ 2 , (1.113) 



where K\ is just the set of initial states with all-zero ancilla for the verification procedure 
V, and /C 2 is the set of initial states for which applying V yields a 1 in the first qubit with 
certainty. Hence, the maximum overlap between vectors in K,\ and /C 2 is directly tied to 
the maximum probability with which we can obtain outcome 1 with an initial state with 
all-zero ancilla. In particular, we have cos 2 <p()Ci, /C 2 ) equals the maximum probability of 
outputting 1. Since in this section we are dealing with the NO case, however, meaning no 
proof can cause an output of 1 with probability greater than e, we have cos 2 y(/Ci, /C 2 ) < e, 
implying: 

(1 + cos ¥>(£!, /C 2 )) < (1 + y/e). (1.114) 

Adding the contributions of all three projections thus yields: 

cos 2 a(£ 1 ,£ 2 )=max| J/>e£2 ( 2/ |n £l |y)< (j^j + (t^t) = 1 ~ TTT" (L115) 

Using the identity sin 2 x + cos 2 x — 1, this implies sin 2 a(£i, £ 2 ) > (1 — y/e) /(L + l). Then, 
since sin 2 | > | sin 2 x (shown using the identity sin(2x) = 2 sin x cos x), we have 

sin 2 %^>^inMA,£ 2 )>^|. (1.116) 

Finally, we have all estimates required to use Lemma 1.8: v = A/L 2 for some constant 
A and sin 2 [a(£i, £ 2 )/2] > (1 - \fe)/[A(L + 1)]. In addition, given Equations (1.108) 
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and (1.109), it is now easy to see that C\ fl £2 = {0} (as required by Lemma 1.8), since 
any state of the tensor product form \ip) pa <8> |7) c cannot live in C\. Plugging everything 
into Lemma 1.8, we conclude that in the NO case, the minimum eigenvalue of H is of the 
order fi((l — y^/L 3 ) (i.e. H has no "small" eigenvalues). As required by Definition 1.7, 
note that this lower bound is inverse polynomially separated from the upper bound on the 
smallest eigenvalue of H from the YES case if we first apply error reduction to V to bring 
e inverse polynomially close to 0. 

Is the Hamiltonian H 5-local? 

We have so far set up a Hamiltonian H whose eigenvalues are small or large, depending 
on whether we have a YES or NO instance of our QMA problem P, respectively. We now 
ask: Is H 5-local? 

The answer is almost. Recall that H G U{E® m <g> B® N - m <g> C i+1 ), where the counter 
register is C L+1 . If we implement the counter straightforwardly using O(logL) qubits, the 
resulting operations on it, such as incrementing the counter, could require updating all 
O(logL) qubits, making H (logL)-local at best. In order to circumvent this, Kitaev [171] 
uses a different representation for the counter for which any operation requires acting on 
at most 3 qubits of the counter. Specifically, we let H act on B® m <g) g®JV-m ^ g L ^ where 
the counter register is now given in unary, i.e. \j) G C L+1 is represented as 

|1,...,1,0,...,0>. (1.117) 

j 

The operator basis for £(C i+1 ) translates to this new representation as follows. 
Operator G £(C L+1 ) is mapped to |1)(1| J . <g> |0)(0| j+1 in the new space, i.e. being 

in state \j) in the old encoding is equivalent to having the jth qubit set to 1 and the 
(j + l)-th qubit set to in the new encoding. Similarly, operator \j — l)(j| is mapped to 
|l)(l|j _i ® |0)(lL <S> |0)(0| J+1 , i.e. if we think of \j — as moving us from state \j) to 
\j — 1), this is equivalent in the new encoding to flipping the jth bit to 0, followed by a 
safety check that qubits j — 1 and j + 1 are 1 and 0, respectively. The remaining basis 
elements are defined analogously. These operations are at most 3-local. Combined with 
the fact that H is based on the verification circuit V, which itself is composed of 2-qubit 
unitaries Vi, we have that H is 5-local Hamiltonian, as desired. 

With H being 5-local, there is one final issue to be addressed — since the counter is 
now represented using a larger space, one must deal with the possibility of invalid settings 
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to the counter register. To discourage such behavior, a fourth penalty term is added to H 
acting only on the counter space, namely 

L-l 



Hence, the new H is given by H = H m + H prop + H out + H stab . Note that H stah discourages 
counter states which are not of the form in Equation (1.117), i.e. states containing the 
subsequence 01 are given an energy penalty. 

Does the previous analysis of the smallest eigenvalue of H still hold when H sta b is added 
to the picture? The answer is yes. The YES case is easy to see, since all valid counter 
states are in the null space of H sta b- Thus, an honest proof receives no energy penalty from 
if stab, as desired. 

For the NO case, let S = B® m ®B® N ~ m <g>C L+1 (the original space we had defined H as 
acting on). Observe that H m + H prop + H out and if sta b both act invariantly on S, meaning 
they map operators in S to operators in S. Thus, we can split our analysis into two 
independent cases: when H acts on S, and when H acts on the orthogonal complement 
of S, denoted iS -1 . In the former case, i? s tab is just the zero operator with respect to 
S; thus, the previous eigenvalue analysis goes through unscathed, yielding an eigenvalue 
lower bound on H of fi((l — y^/L 3 ). As for the second case when H is restricted to 
observe that -£4t a b always administers an energy penalty, since S x contains only invalid 
counter states. Since H sts , h is a sum of commuting projectors, its eigenvalues will be non- 
negative integers — in particular, its smallest non-zero eigenvalue is at least 1. Since 
ifin + H prop + H out y 0, it follows that when restricted to we have H >z 1. Taking the 
minimum of the estimates for the two cases of S and S 1 - yields the desired bound that the 
smallest eigenvalue of H is still in — y/e)/L 3 ), despite the new representation for the 
counter. This concludes Kitaev's proof that 5-local Hamiltonian is complete for QMA. 

1.6 Quantum correlations 

As mentioned earlier, the growing field of quantum computation and information has posi- 
tively impacted both computer science and physics. The next area this thesis studies has in 
particular benefited greatly from this cross-fertilization, and is the study of quantum corre- 
lations. Here, we are interested in understanding correlations between individual quantum 
subsystems of a larger composite system. Specifically, we shall introduce and discuss two 
notions of quantum correlations: quantum entanglement and non-classical correlations. 




(1.118) 



47 



Motivation. We mention two reasons why the study of quantum correlations is impor- 
tant. The first is that the existence of certain correlations predicted by quantum theory, 
specifically quantum entanglement, has long troubled physicists. In a letter to Max Born 
in 1947, for example, Einstein dubs entanglement as "spukhafte Fernwirkung" , or "spooky 
action at a distance" [50]. This mentality was moreover the basis for the rejection of 
quantum mechanics as a complete physical theory a decade earlier by the famous Einstein, 
Podolsky, and Rosen (EPR) paper of 1935 [89]. Thus, a better understanding of quantum 
correlations appears to be key to understanding both the nature of our world around us, as 
well as our theories describing this world. The second reason is that quantum correlations 
are generally believed to be required for quantum computers to outperform their classical 
counterparts. It has been rigorously shown, for example, that in the pure-state setting, 
the amount of entanglement present in a quantum system must grow with the problem 
size if a quantum computation is to achieve an exponential speedup over classical comput- 
ers [160]. Thus, a better understanding of quantum correlations may prove advantageous 
for designing quantum algorithms, as well as for uncovering the boundary between classical 
and quantum computing. 

1.6.1 Quantum entanglement 

The canonical notion of quantum correlations between quantum systems dates back to 
the EPR paper of 1935 [ 9], and is called quantum entanglement. The name "entangle- 
ment" was coined by physicist Erwin Schrodinger, who used the term "Vershrankung" in 
1935 [221], which in colloquial "non-physicist" German means "folding of the arms" [61]. 
Much has been discovered in the field of entanglement theory over the last two decades, from 
its quantification and characterization, to its manipulation and use for quantum computa- 
tional and information theoretic tasks. In particular, what was once considered "spooky 
action at a distance" is now regarded as a valuable resource in quantum information (see, 
e.g. [ ]). In this thesis, entanglement is not a primary focus, but rather has important 
connections to non-classical correlations in the results of Chapters 7 and 8. We give a brief 
introduction to entanglement here; the reader is referred to the surveys of BruB [6 I ] and 
Horodecki® 4 [151] for further details. 

To begin, the canonical example of an entangled state is the two-qubit EPR pair, 



By observing that Tri(|0 + )(0 + |) = Tr 2 (|0 + )(0 + |) = 1/2, we have one of the characteristic 
traits of quantum mechanics — that for quantum systems, knowledge of the whole quantum 




(1.119) 
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system does not imply knowledge of its parts. Since entangled (pure) states, such as the 
EPR pair, cannot be written as a product state \ipi) <S> l^) of single qubit states \ipi), ^2), 
a primary area of study in quantum information has been the quantification of "how far" 
an entangled state is from product form. (Note that all classical states, by which we mean 
bit strings, are of product form.) 

The answer to this question varies greatly depending on context. For bipartite pure 
states \iPab) G P(C m ®C n ), the canonical measure of entanglement is given by the entropy 
of entanglement [151], 



where S(p) := — Tr(plog(p)) is the von Neumann entropy of p. It holds that < 
E(\i/jab)) < log(min(m, n)), where the lower bound is achieved if and only if a state is 
of product form, and the upper bound is achieved if and only if a state is maximally 
entangled, such as the EPR pair. 

The definition of EQi/jab)) is perhaps better motivated by the fact that any bipartite 
IV'ab) £ C m (g) C n can be written in terms of the Schmidt decomposition, such that 



Here, the real cti > are called Schmidt coefficients, and the sets {(V'i)} and are 
orthonormal bases for C m and C n , respectively, known as the Schmidt bases. The Schmidt 
decomposition is extremely useful in quantum information; some of our results in Chapter 2, 
for example, depend heavily on it. A proof of existence for the Schmidt decomposition is 
straightforward, and makes use of the vec mapping (defined in the proof of Corollary 7.8 
here) and singular value decomposition for operators; we refer the reader to [246] for details. 
To now see the connection between E and the Schmidt decomposition, let p G K min ( m > n ) 
with p(i) = af. Then, E(\i]j)) = H(p), where H(p) := — Y^iVif) l°gp(0 is the Shannon 
entropy of probability distribution p. In the other words, the more "tightly concentrated" 
the Schmidt coefficients of \i/jab) are, the less entangled |Vub) is. Note that a state is 
product if and only if it has a Schmidt coefficient a* = 1, and a state is maximally entangled 
if and only if all its Schmidt coefficients are l/y/d for d = min(m,n). 

Moving to the mixed state case, the quantification of entanglement becomes much more 
complex. Most generally, we say operator p G Pos {X <g> 3^) is separable (i.e. unentangled) 
if and only if it can be written [250] 



E^ab)) = S* (Tr £ I ipAB ) {tpAB I ) = S(Tia\^ab)(^ab\), 



(1.120) 




(1.121) 




(1.122) 
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for Ai G Pos(X) and Bi G Pos(3^)- This definition of separability (with the added trace 
one constraint) was first given by Werner [25 J]. We denote the set of separable operators 
acting on X £g> y as Sep (X ,y). Note that Sep (X, y) is a convex cone; this property is 
vital to the results of Section 4.5. Here, a cone is a set S C X such that Ax G S for all 
x G S and all A > 0. If additionally u + v G S for u,v £ S, then S" is called a convex cone. 
When we restrict ourselves to the set of separable density operators in Sep (X, y) (i.e. we 
impose the trace one constraint), we obtain a convex set. (A set 5* C X is called convex if 
px + (1 — p)y G S for all x,y £ S and < p < 1.) The set of separable density operators 
has the following properties, which prove useful in Section 4.5: It is compact and contains 
a ball around the maximally mixed state (which is, of course, separable) [124, 125, 126]. 

The problem of determining whether a given density operator p G T>{X ® y) is in 
Sep (X, y) (where one is allowed to work in time polynomial in the dimension), known as 
the Quantum Separability Problem, was shown NP-hard to solve within inverse exponential 
precision by Gurvits [123] (see also the work of Ioannou [151]). This was later extended to 
inverse polynomial precision by the present author [105], and shortly thereafter indepen- 
dently by Beigi [39]. Recently, a breakthrough result of Christandl, Brandao, and Yard [53] 
has shown that the problem is quasi-polynomial-time solvable for the case of constant pre- 
cision; the result goes via a powerful new de Finetti-type theorem for the Frobenius (and 
LOCC, where LOCC stands for local operations and classical correlations) norms. 

Thus, as suggested by the NP-hardness of Quantum Separability Problem, in the mixed- 
state case there is no known efficient test for separability, unlike the pure-state case. To 
this end, there have been many mixed state entanglement measures proposed to date; the 
reader is referred to the survey of Horodecki 04 [151] for an in-depth look. 

Here, we mention two entanglement detection schemes used in this thesis. The first 
is the popular approach proposed by Peres [ 06] known as the positive partial transpose 
(PPT) test, which plays a role in Chapters 7 and 8. Specifically, consider the super-operator 
I ®T acting on space C(X <g> y), where T denotes the transpose map. Then, given any 
p G T>(X (g) y), if (I <g) T)(p) ^ 0, then p is not separable. This follows since for any 
separable operator £V Ai ® B iy 

(/ <g> T) \ S2 A ® BA = ^ ® T ( B i) h 0. (1.123) 

Above, we have used the fact that the transpose map does not change the spectrum of an 
operator. The PPT test is known to be necessary and sufficient for pure states of all dimen- 
sions, and for mixed states of (2 x 2) and (2 x 3)-dimensional systems [206, 146]. In higher 
dimensions, however, we remark that there exist mixed entangled states which nevertheless 
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have a positive partial transpose; such states are called bound entangled [147, 144]. Bound 
entangled states have the property that they cannot be distilled, meaning roughly that in 
the asymptotic limit, given many copies of a bound entangled state p, there does not exist 
an LOCC (local operations and classical communication) protocol which can extract the 
entanglement present in the copies of p into pure EPR pairs. The quantification of just 
how much entanglement can be distilled in this sense is given by another entanglement 
measure, the distillable entanglement [210]; this makes a brief appearance in Chapter 7. 

Finally, there is an easy way to compute the partial transpose given a matrix represen- 
tation of state p G D(C m ® C n ): Namely, partition the matrix into (m x n)-dimensional 
blocks, and take the tranpose of each block individually. For example, for the EPR pair 
= (|00) + \l\))/y/2, we have 

(/®T)(|0+)(0 + |) = (/®T) 
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The second entanglement detection scheme we define here is the relative entropy of 
entanglement [238, 145]. Specifically, define for p, a G T>(X) the relative entropy as 

S(p\\a) := -Tr(p]oga) - S(p). (1.125) 

Then, for p G T>{X <g> y), the relative entropy of entanglement is defined as 

E R ( P )= min S(p\\a). (1.126) 

The following properties regarding Er hold [238]: It takes value if and only if p G 
Sep(X,y), is invariant under local unitary operations, is convex, reduces to the entropy 
of entanglement for pure states, and is an upper bound on the distillable entanglement 
(see also [212]). It is further non-increasing under LOCC, which follows since S(p||er) > 
S(§(p)\\§(a)) for any TPCP map $ [237]. In fact, a stronger and physically more rel- 
evant statement holds — that even if we allow post-selection after performing an LOCC 
measurement, the value of Er does not increase on average ['. 38]. In other words, let {K{\ 
be a complete set of Kraus operators for a TPCP map, i.e. £\ K\Ki = I. Then, letting 
Pi := KipKj, it holds that 

E R (P)>J2^(P^(^)- (1.121) 
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We close this section by noting that the definition of separability presented here extends 
straightforwardly to the multipartite setting. The structure of multipartite entanglement, 
however, is markedly more daunting than in the bipartite case. 

1.6.2 Non-classical correlations 

Having discussed quantum entanglement, we now turn our attention to another form of 
quantum correlations, called simply non-classical correlations. Such correlations have at- 
tracted much attention in the last decade or so, both in terms of their characterization and 
quantification, as well as with respect to their use as a resource in quantum information. 
We begin by motivating the study of non-classical correlations, and follow with definitions. 
We then discuss the role of such correlations in quantum information processing tasks, and 
close by surveying a number of known non-classicality measures. The reader is referred to 
Modi et al. [195] for a more comprehensive survey of the topic. 

Motivation. As mentioned earlier, it is known that in the case of pure-state quantum 
computation, entanglement is a necessary resource for exponential speedup over classical 
computers [160]. What happens, however, if we instead consider mixed-state quantum 
computing? This is a particularly relevant question, as typically one deals with mixed 
states in a laboratory setting due to noise from the environment. In 1998, Knill and 
Laflamme [171] proposed a model of computing known as Deterministic Quantum Com- 
puting with one clean qubit (DQC1) (see Chapter 5), wherein all but one qubit of the 
computation are initialized to the maximally mixed state — in other words, the quantum 
computation acts on a highly mixed state. (Note that this model is motivated experimen- 
tally by nuclear-magnetic resonance (NMR) information processing, in which states are 
highly mixed.) Yet, this model can perform the task of (normalized) trace estimation of 
a given unitary exponentially faster than the best known classical algorithm. This raises 
the question: Is entanglement also the root of the believed speedup in DQC1? (This is 
a natural question since "very highly mixed" states are separable due to a ball around 
the maximally mixed state in the set of separable quantum states [124, 125, 126].) Or 
are there other correlations possibly at play? Recent work has suggested that although 
Erwin Schrodinger once wrote that entanglement is "not just one of many traits, but the 
characteristic trait of quantum physics" [221] (as quoted in [195]), that between purely 
classical correlations and entanglement, there lies another form of quantum correlations 
whose nature is only now beginning to be understood. Such correlations are known simply 
as non-classical correlations. 
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Defining non-classical correlations. We now define what we mean by non-classical 
correlations. To begin, we say that a quantum state pab £ T>{X (&y), henceforth denoted 
as p to avoid clutter, is strictly classically correlated or classical if it can be diagonalized 
in a local product basis. In other words, p is classical if there exist local orthonormal bases 
(IV'z)}) f° r % an d y> respectively, such that 



for {Aj} the eigenvalues of p. Note that such a state is simply an embedding of a classical 
bipartite distribution into the quantum formalism. Any state not satisfying this definition is 
called non- classical. We remark that this definition of classicality extends straightforwardly 
to the multipartite setting. 

Continuing in the bipartite setting, a particularly interesting class of states which sub- 
sume the classical states are the so-called classical-quantum (CQ) states, which are only 
classical in system A. Specifically, a state p £ T>(X <g> y) is CQ if there exists a local 
orthonormal basis (iV'i)} f° r X such that 



for {p^ a probability distribution and for arbitrary pi £ T>(y). Note that system A in p 
simply plays the role of a classical label: Upon measuring it in basis (iV'i)} an d obtaining 
outcome i, we know the induced state pi in B. Also, observe that CQ states are separable. 
An analogous definition straightforwardly yields the similar class of quantum-classical (QC) 
states. As an aside, note that neither classical nor CQ states form a convex set, unlike the 
set of separable quantum states. 

Non-classical correlations and quantum information processing. A number of 
connections are known between non-classical correlations and quantum information pro- 
cessing tasks, involving for example local broadcasting [ )9, 188], extended state merg- 
ing [62], the locking of classical correlations [87, 78, 259, 49] (see Chapter 5), assisted 
optimal state discrimination [213, 181], remote state preparation [74], entanglement distri- 
bution [230, 68], and activation of non-classical correlations into entanglement [208] (see 
Chapter 7, and also related work by Streltsov, Kampermann and Brufi [231]). We now 
discuss two of these tasks: local broadcasting and entanglement distribution. 

We begin with the task of local broadcasting. Specifically, generalizing the no-cloning 
theorem of Section 1.4.5 is the following statement. Given a state p £ T>(X), we say 
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p' G T>(X ® X) is a broadcast state for p if 



Tn(p') = Tr 2 (p') = p, (1-130) 

where the 1 : 2 split is across the two copies of X. Now, suppose we are given a set of 
density operators {p^ C T>(X), and some arbitrary starting state a. Then, the statement 
we are interested in is that there exists a TPCP map A G T(X® X) which, for all i, achieves 
the mapping p^ g) a i-> p'j G <8> A?) for p^ a broadcast state for pj if and only if the pi 
pairwise commute. This is called the no-broadcasting theorem [37, 36]. With respect to 
non-classical correlations, a variant of this theorem is the no-local-broadcasting theorem of 
Piani et al. [ )9, 188], which states that for any bipartite state pab G V(X®y), there exist 
local TPCP maps B A G T(X, X®X) and Q B G T(y, y®y) such that Q A <g> ©b^b) is a 
broadcast state if and only if pab is strictly classical. Thus, the classicality of correlations 
in p is strongly tied to how well one can carry out the information theoretic task of local 
broadcasting. 

We next discuss the task of entanglement distribution. Consider a tripartite system 
ABC consisting of Alice, Bob, and a carrier system C. Roughly, the goal of entanglement 
distribution is for Alice and Bob to increase the entanglement between their systems A and 
B by having Alice send Bob the carrier system C. More specifically, we imagine Alice holds 
systems A and C to start, and Bob holds system B. Alice applies some encoding operation 
jointly to A and C. She then sends C to Bob. Bob finally applies some decoding operation 
to B and C. We now ask: Is the entanglement in the AC : B cut before the protocol 
was run strictly smaller than the entanglement in the A : BC cut after Bob receives the 
carrier C? What is perhaps most surprising about this task is that the answer to this 
question can be yes even if the carrier C is not entangled with A and B throughout the 
protocol [73]! Motivated by the question of whether non-classical correlations could be the 
resource behind this phenomenon, Streltsov et al. [230] and Chuan et al. [ ] (both works 
appeared concurrently and independently) showed that (definitions to follow) 



Ef lB (p ABC ) - E^ BC (a ABC ) < 8^{a ABC ), (1.131) 

where p AB c is the state before the protocol is run, a AB c is the state once Bob receives C 
from Alice, and where we measure non-classicality by the relative entropy of discord (RED) 
Sr B ^ C of Equation (1.140) (to be defined shortly) across the AB : C cut, and we measure 
entanglement by the relative entropy of entanglement (E^ 30 ) across the AC : B 

(A : BC) cut . In other words, the amount of entanglement which can be transferred from 
Alice to Bob is bounded by the amount of non-classical correlations between the carrier C 
and AB (after Alice has applied her encoding operation). Note thus that this upper bound 



AB\C , 
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can be non-zero even if C is unentangled with A and B throughout the protocol (and in 
fact must be non-zero for the example of Cubitt et al. [73] mentioned above). 

Quantifying non-classical correlations. Finally, we close this section by discussing a 
number of known non-classicality measures. 

The formal notion of CQ states first arose with the works of Ollivier and Zurek [! | 
and Henderson and Vedral [138], where a measure of quantum correlations dubbed the 
quantum discord was proposed. The aim of this measure is to quantify purely quantum 
correlations in a bipartite state p. To define the discord, recall first that the (classical) 
mutual information is a measure of correlation between (classical) random variables A and 
B, i.e. 

X(A : B) = H{A) + H(B) — H(A, B), (1.132) 

where H is the Shannon entropy defined in Section 1.6.1 and H(A,B) = — ^2 ab P^(A = 
a n B = b) logPr(A = a n B = b). Using the fact that Pi(B\A) = Pr(A n B)/Vt(A), one 
can straightforwardly also express the mutual information as 

J(A : B) — H{B) — H(B\A), (1.133) 

where H(B\A) := ^2 a P?{A = a)H(B\A = a). Although X and J are equivalent in 
the classical setting, their quantum counterparts no longer share the same relationship. 
Specifically, the quantum mutual information can be defined as 

l(p AB ) = S(p A ) + S(p B ) - S(pab), (1-134) 

where recall pA = T^b(pab)- However, a quantum variant of J is non-trivial to define, since 
it requires specifying a value for B for the conditional entropy H(A\B) — in particular, 
unlike the classical setting, quantumly the choice of measurement basis is non-trivial. 
To this end, for rank-one projective measurement {H^}, one defines [ ] a quantum 
conditional entropy 

S (p Bl {n?}) :=^(( n > /B W n i^V Pi )' (L135) 

j 

where pj = Tr(H^ <g> I B p). Then, a quantum version of J for given measurement basis 
{n^ 1 } can be defined as 

J{n$}{p) = s (Pb) ~ S (p B |{ n A}) • (1.136) 
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Note that J7| n ^j(p) quantifies the amount of classical correlations which can be extracted 
from pab via a projective measurement on one party; since we are in the end interested 
in purely quantum correlations, intuitively one would thus choose the optimum measure- 
ment {ihf } here so as to extract all purely classical correlations, leaving only quantum 
correlations behind. With this in mind, the quantum discord is now defined as 

S(p) := X(p) - max Jr U A\(p) = S(p A ) ~ S(p AB ) + min S (p ,r A ,) . (1.137) 

The discord is [195] non-negative, non-symmetric with respect to exchange of systems A 
and B, invariant under local unitaries, and most importantly for our discussion here, takes 
value zero if and only if p is CQ [ )3, 79]. Moreover, there exist separable states, such as 
the two-qubit state 

i|0><0|®|0)<0| + i|+>(+|®|l>(l|, (1-138) 

which have non-zero discord, thus showing that discord quantifies correlations beyond 
entanglement. (Aside: The state above is studied further in Chapters 6, 7, and 8.) 

The next measure of non-classical correlations we discuss is the geometric quantum 
discord [ ]. Let CQ C T>(X <S> y) denote the set of classical-quantum states. Then for 
p <EV(X ®y) the geometric discord is defined as 



8g(p) '■= m i n II P — °~ Ht? = mm 



;i.i39) 



where || ■ || F is the Frobenius norm and the second equality was shown by Luo and Fu [186]. 
The name geometric derives from the fact that the measure attempts to quantify distance 
from CQ via a metric. We have included the right-most expression in Equation (1.139) 
as it offers another intuitive interpretation of non-classical correlations involving distur- 
bance under measurement. Namely, recall that in the classical world, there always exists 
a choice of measurement basis {nf} (the computational basis) leaving the target state 
undisturbed. In the quantum setting, however, this is in general not the case. For exam- 
ple, this is an intuitive reason why CQ states are considered classical in A; there exists 
a measurement basis acting invariantly on A. The second expression for 5c{p) in Equa- 
tion (1.139) thus attempts to understand how much p must be disturbed in a (rank one 
projective) measurement, regardless of the choice of local measurement basis for A. 

The next non-classicality measure we discuss is similar to the geometric discord, but 
replaces the Frobenius norm with the relative entropy. We thus arrive at the relative 
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entropy of discord (RED) [196], 



Sr(p) 



min S(p\ \o~). 



(1.140) 



An analogous definition for the case of general strictly classically correlated states goes 
under the name of the relative entropy of quantumness (REQ) [54, 187, 118, 217, 196]; this 
is studied further in Chapters 7 and 8. 

Interestingly, the RED turns out to be equal to (a variant of) another measure of non- 
classical correlations we discuss next, the quantum deficit [145]. The latter's definition is 
motivated by work extraction from quantum systems coupled to a heat bath. Roughly, the 
idea here is that a state is strictly classically correlated if and only if the same amount of 
work can be drawn from the global state versus from the local subsystems after allowing a 
suitably restricted subset of local operations and classical communication (LOCC) known 
as closed LOCC. The variant of the deficit which is equal [145] to the RED is the one-way 
deficit A™*, given by (simplified from the original definition): 



Here, {n^} again denotes a rank-one projective measurement. The correspondence be- 
tween RED and the deficit does not stop here, however; the two-sided analogue of the 
RED, the REQ, is equal [145] to the so-called zero-way deficit A : 



We have discussed a number of non-classicality measures here. Later in Chapter 6, we 
introduce a novel measure of non-classical correlations based on local unitary operations, 
which for (2 x iV)-dimensional quantum states turns out to coincide with the geometric 
discord. Chapters 7 and 8 then introduce and study a protocol for "activating" non- 
classical correlations into entanglement, while also providing an operational interpretation 
for the REQ. 




(1.141) 




(1.142) 
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Chapter 2 



Approximation algorithms for 
QMA-complete problems 

This chapter is based on [108]: 

S. Gharibian and J. Kempe. Approximation algorithms for QMA-complete problems. In 
Proceedings of 26th IEEE Conference on Computational Complexity, pages 178-188, 2011, 
DOI: 10.1109/CCC.2011.15, © 2011 IEEE, ieeexplore.ieee.org. 

Approximation algorithms for classical constraint satisfaction problems are one of the main 
research areas in theoretical computer science. In this chapter, we define a natural approx- 
imation version of the QMA-complete local Hamiltonian problem and initiate its study. 
We present two main results. The first shows that a non-trivial approximation ratio can 
be obtained in the class NP using product states. The second result (which builds on the 
first one), gives a polynomial time (classical) algorithm providing a similar approximation 
ratio for dense instances of the problem. The latter result is based on an adaptation of the 
"exhaustive sampling method" by Arora et al. [ ] to the quantum setting, and might be 
of independent interest. 

2.1 Introduction and results 

In the last few years, the quantum analog of the class NP, the class QMA [171], has 
been extensively studied, and several QMA-complete problems have been found [182, 55, 
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184, 40, 215, 158, 223, 253]. Arguably the most important (and historically first) QMA- 
complete problem is the fc-local Hamiltonian problem [171, 164, 202, 163, 20]. Recall from 
Section 1.5.4 that here, the input is a set of Hamiltonians (Hermitian matrices), each acting 
on at most /c-qubits each. The task is to determine the largest eigenvalue of the sum of 
these Hamiltonians. This problem generalizes the central NP-hard problem MAX-/c-CSP, 
where we are given a set of Boolean constraints on k variables each, with the goal to 
satisfy as many constraints as possible. The local Hamiltonian problem is of significant 
interest to complexity theorists and to physicists studying properties of physical systems 
alike (e.g. [60, 15, 58, 17, 69, 176, 222]). 

Moving to the classical scenario, the theory of NP-completeness is one of the great 
success stories of classical computational complexity [27]. It was soon realized that many 
natural optimization problems are NP-hard, and are hence unlikely to have polynomial time 
algorithms. A natural question (both in theory and in practice) is to look for polynomial 
time algorithms that produce solutions that are close to optimum. More precisely, one says 
that an algorithm achieves an approximation ratio of c G [0, 1] for a certain maximization 
problem if on all inputs, the value of the algorithm's output is at least c times that of 
the optimum solution (the output value should also be at most the optimal solution). 
The closer c is to 1, the better the approximation. The investigation of approximation 
algorithms is, after decades of heavy research, still a very active area (e.g., [141, 236]). 
For many central NP-hard problems, tight polynomial time approximation algorithms are 
known. 

In the context of QMA-complete problems, it is thus natural to search for approxima- 
tion algorithms for these problems, and in particular for the local Hamiltonian problem. 
The question we address here is: How well can one efficiently approximate the k-local 
Hamiltonian problem? 

It should be noted that a large host of heuristics has been developed in the physics 
community to approximate properties of local Hamiltonian systems (see, e.g., [69] for a 
survey) and this area is extremely important in the study of physical systems. However, 
the systematic complexity theoretic study of approximation algorithms for QMA-complete 
problems is still very much in its infancy, and our work is one of the first steps in this 
research direction. We note that there has been a lot of interest in recent years [17, 6] in 
establishing a so-called quantum PCP theorem [30, 29], which amounts to showing that for 
some constant c < 1 close enough to 1, approximating the fc-local Hamiltonian (or related 
problems) to within c is QMA-hard. Our results can also be seen as a natural continuation 
of that investigation. 
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Our results: Let us start by precisely defining the optimization version of the local 
Hamiltonian problem, which is parameterized by two integers k and d, which we always 
think of as constants. Note that the definition below differs slightly from that given in 
Section 1.5.4, Definition 1.7; we discuss the differences after stating the definition. 

Definition 2.1 (MAX-A;-local Hamiltonian problem on d-level systems (qudits)). An in- 
stance of the problem consists of a collection of m Hermitian matrices, one for each subset 
of k qudits. The matrix i/i lr „,i fc corresponding to some 1 < i\ < ■ ■ ■ < < n is assumed 
to act on those qudits (terms acting on less than k qudits can be incorporated by tensoring 
them with the identity), to be positive semidefinite, and to have operator norm at most 1. 
We call any pure or mixed state p on n qudits an assignment and define its value to be 
Tr (Hp) where H = i Hi lt ,„j k . The goal is to find the largest eigenvalue of H (de- 

noted OPT ), or equivalently, the maximum value obtained by an assignment. We say that 
an algorithm provides an approximation ratio of c G [0, 1] if for all instances, it outputs a 
value that is between c ■ OPT and OPT. 

This definition, we believe, is the natural quantum analog of the MAX-fc-CSP problem. 
We note that it differs slightly from the usual definition of the fc-local Hamiltonian problem. 
Namely, we consider maximization (as opposed to minimization), and also restrict the 
terms of H to be positive semidefinite, and have norm at most 1 (the latter two contraints 
are also common to Definition 1.7; more generally, the local terms of H can be arbitrary 
Hermitian operators). As long as one considers the exact problem, these assumptions are 
without loss of generality, and do not affect the definition, as seen by simply scaling the 
Hamiltonians and adding multiples of identity as necessary. However, when dealing with 
the approximation version, these assumptions are important for the problem to make sense; 
for instance, one cannot meaningfully talk about approximation ratios if the optimum 
can take both negative and positive values. That is why we require the terms to be 
positive semidefinite. The requirement that the terms have operator norm at most 1 
does not affect the problem and later allows us to conveniently define dense instances. 
Finally, changing the maximization to a minimization would lead to an entirely different 
approximation problem: the quantum analogue of MIN-CSP (e.g. [167]). Minimization 
problems are, generally speaking, harder than maximization problems, and we leave this 
research direction for future work. 

Before stating our results, we state a trivial way to get a <i _fc -approximation for MAX- 
fc-local Hamiltonian. Observe that the maximally mixed state has at least d~ k overlap with 
the reduced density matrix of the optimal assignment on any k particles. A similar property 
holds classically, where a random assignment gives (in expectation) a d~ k approximation 
of MAX-A;-CSP. We now describe our two main results. 
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Approximation by product states. One inherently quantum property of the local 
Hamiltonian problem is the fact that the optimal state might in general be highly entangled 
(and hence not efficiently describable in polynomial time or space). This is why we do not 
require outputting the assignment itself in the above definition. If, however, the optimal 
assignment (or some other good assignment) was guaranteed to be a product state, then 
we could describe it efficiently. The following theorem shows just that. 

Theorem 2.2. For an instance of MAX-k-local Hamiltonian with optimal value OPT, 
there is a (pure) product state assignment that has value at least OPT/G? fc_1 . 

This result is tight for product states in the case of 2-local Hamiltonians (we remark 
that 2-local Hamiltonians are often the most relevant case from a physics perspective). For 
example, consider the Hamiltonian on 2-qubits that projects onto the EPR state ^(|00) + 
1 11)). It is easy to see that no product state achieves value more than 1/2. For general 
d and k, we can only show that product states cannot achieve an approximation ratio 
greater than l/c^/ 2 ! (see Section 2.2, where better bounds in more specific cases are also 
discussed) . 

If we could efficiently find the best product state assignment, we would obtain an 
algorithm achieving a non-trivial d~ k+1 approximation ratio. Unfortunately, this problem 
is NP-complete, since it would allow one to solve (e.g.) the special case of MAX-fc-SAT 
(as discussed in Section 1.5.4, for each clause C acting on variables {zi,...,Zfe} in an 
instance of MAX-fc-SAT, define the corresponding Hamiltonian term i?^,...,^ diagonal in the 
computational basis and projecting onto the satisfying assignments for C . Then, without 
loss of generality, the optimal product state assignment can be taken to be a computational 
basis state), implying such an algorithm cannot exist unless P = NP. Still, the theorem 
has the following interesting implication: It shows that unless NP = QMA, approximating 
the local Hamiltonian problem to within a factor less than d~ k+1 is not QMA-hard. This 
follows simply because product states have polynomial size classical descriptions. (More 
accurately, since one uses a polynomial number of classical bits to approximately specify a 
product state in NP, the ratio in the implication above is d~ k+1 — f(m) for some function 
/ which scales inverse exponentially in the input size m.) 

A polynomial time approximation algorithm for dense instances. Our second 
result gives a classical polynomial time approximation algorithm for dense instances of the 
local Hamiltonian problem. This result is perhaps our technically most challenging one, 
and we hope the techniques we develop might turn out useful elsewhere. 
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Dense instances of classical constraint satisfaction problems have been studied in depth 
[81, 101, 115, 28, 82, 23, 38, 83]. Our result is inspired by work of Arora et al. [28] 
who provide a polynomial time approximation scheme, or PTAS (i.e., an efficient 1 — 
e approximation algorithm for any fixed e > 0), for several types of dense constraint 
satisfaction problems. In the classical case, dense (for 2-local constraints) simply means 
that the average degree in the constraint graph is Q(ri), or equivalently, that the optimum 
is Q(n 2 ). In analogy, we define an instance of MAX-fc-local Hamiltonian to be dense if 
OPT = Q(n k ), or equivalently, if Tr(H-L) = £l(n k ) (the equivalence follows from the fact 
that the mixed state assignment I/d n has value between OPT and OPT/d k ). 

It is not hard to see that the (exact) dense local Hamiltonian problem remains QMA- 
hard (see Section 2.3.3). We hope the dense case might be of practical interest to physicists 
who study systems of particles by incorporating all possible interactions between them. Our 
second main result is the following: 

Theorem 2.3. For all e > there is a polynomial time (l/d k ^ 1 — e)- approximation algo- 
rithm for the dense MAX-k-local Hamiltonian problem over qudits. 

Theorem 2.3 follows immediately by combining Theorem 2.2 with the following theorem, 
which gives an approximation scheme for the problem of optimizing over the set of product 
states. 

Theorem 2.4. Let OPTp denote the value of the optimal product state assignment for an 
instance of MAX-k-local Hamiltonian H . Then, for all e > 0, there is a polynomial time 
algorithm which outputs a product state assignment attaining value at least OPTp — en k . 
For all e > 0, this yields an efficient (1 — e)- approximation algorithm for computing OPTp 
for dense MAX-k-local Hamiltonian. 

We remark that the algorithm of Theorem 2.4 also applies in the minimization setting, 
in which one is interested in computing the smallest eigenvalue of /c-local Hamiltonian H. 
Here, our algorithm outputs a value at most OPTp + en k . 

Proof ideas and new tools: The proofs of Theorem 2.2 and Theorem 2.4 are indepen- 
dent and employ different techniques. To show the product state approximation guarantee, 
we show a slightly stronger statement: For any assignment there is a way to construct 
a product assignment of at least d~ k+1 its value. The proof is constructive (given we 
use a type of recursive Schmidt decomposition of to obtain a mixture of product states 
whose value is off by at most the desired approximation factor (see Section 2.2). 
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Our second result is technically more challenging and introduces a few new ideas to 
this problem, inspired by work of Arora et al. [28] in the classical setting. We illustrate 
the main ideas for MAX-2-local Hamiltonian on n qubits. Recall that our goal is to find 
a PTAS for the local Hamiltonian problem over product states. The value of the optimal 
product state assignment, OPTp, can be written 

n 

OPTp = max Tr(Hij(pi <8> Pj)) s.t. pi >z and Tr(pj) = 1 for 1 < i < n, 

i=l i6 jv(i) 

(2.1) 

where N(i) is the set of indices j for which a local Hamiltonian term fly is present. We 
might call this a quadratic semidefinite program, as the maximization is quadratic in the 
Pi (and as such not efficiently solvable in general). Note, however, that if the terms in the 
maximization were linear ; then we would obtain a semidefinite program (SDP), which is 
efficiently solvable [121]. To "linearize" our optimization, we use the "exhaustive sampling 
method" developed by Arora et al. [ ] (a method which was later key in many devel- 
opments in property testing, e.g. [115]). We write each Hamiltonian term in a basis that 
separates its two qubits, for instance the Pauli basis {<7o, <J\, cr 2 , 0-3}, fly = Y^l j=o a% ki°~k®o-i. 
For i — 1, . . . , n and k — 0,1, 2, 3, define 

<* : = E E^TrKo,). (2.2) 

jeN(i) 1=0 

If we knew the values of c\ for the optimal pi, then solving the SDP below would yield the 
optimal pi. 

n 3 

max c\Tr(o- k pi) s.t. p,- L y and Tr(pj) = 1 for 1 < i < n, (2.3) 

i=l k=0 

3 

a^Tr(cr/Pj) = c\ for 1 < i < n and < k < 3. 

j'eiv(i) 1=0 

Of course, this reasoning is circular, as in order to obtain the c\ we need the optimal p;. 
The crucial idea is now to use sampling to estimate the c l k . More precisely, assume for a 
second that we could sample O(logra) of the pi randomly from the optimal assignment. 
Then, by standard sampling bounds, with high probability over the choice of the sampled 
qubits we can estimate all the d k to within an additive error ±en for some e. If we had 
these estimates a\ for the c l k , we could solve the SDP above with the slight modification 
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that the last constraint should be a\ — en < J2jeN{i) a kiT T (&iPj) < ajt + en. With 
high probability over the sampled qubits, this SDP will give a solution that is within an 
additive en 2 of the optimal one (more subtle technicalities and all calculations can be found 
in Section 2.3). Moreover, it is possible to derandomize the sampling procedure to obtain 
a deterministic algorithm (Section 2.3.3). 

Of course, we are still in the realm of wishful thinking, because in order to sample 
from the optimal solution, we would need to know it, which is precisely what we set out 
to do. However, the number of qubits we wish to sample is only logarithmic in the input 
size. Thus, to simulate the sampling procedure, we can pick a random subset of O(logn) 
qubits, and simply iterate through all possible assignments on them (with an appropriate 
5-net over the density matrices, which incurs a small additional error) in polynomial time! 
Our algorithm then runs the SDP for each iteration, and we are guaranteed that at least 
one iteration will return a solution within en 2 of the optimal one. Because the denseness 
assumption guarantees that OPTp is Q(n 2 ), our additive approximation turns into a factor 
(1— ^-approximation, as desired. All details, the runtime of the algorithm and error bounds 
for the general /c-local case on qudits are given in Section 2.3. We remark that the approach 
above works analogously in the setting where the objective function involves minimization 
instead of maximization. 

Previous and related work: We note that many heuristics have been developed in 
the physics community to approximate properties of local Hamiltonian systems and this 
area is extremely important in the study of physical systems (e.g. [255, 256, 205, 214, 220, 
)7, 69, 204]). Our focus here is, however, on rigorous bounds (unlike a heuristic) on the 
approximation guarantee of algorithms for the general problem (we allow interactions of 
arbitrary types occurring on arbitrary graphs, in contrast to the more common approach 
of studying specific local Hamiltonian models with certain classes of allowed interactions). 
In this area, to our knowledge, few results are known. In the setting of relative-error 
approximation, as studied here, the first and only previous result we are aware of is that of 
Bansal, Bravyi and Terhal [35] , who give a PTAS for a special case of the local Hamiltonian 
problem, so called quantum Ising spin glasses, for the case where the instance is on a planar 
graph and of bounded degree. Roughly, this PTAS is obtained by dividing the graph into 
constant size chunks, which can be solved directly, and ignoring the constraints between 
chunks (this incurs an error proportional to the number of such constraints, which is 
small because the graph is planar). In the setting of absolute-error approximation, in ID 
models, rigorous results such as Hasting's ID area law are known for gapped systems [133] 
(where it is also shown that the ground state is well-approximated by a Matrix Product 
State [240]), and rigorous approximation methods are known for ID [16, 2 I] and for 2-local 
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Hamiltonians on qubits where the two-qubit interaction strengths are weak [57]. Finally, 
we remark that the use of a product state ansatz is closely related to the mean-field 
approximation or Hartree-Fock method in physics (see, e.g. [90]). 

Discussion and open questions: Our two results give approximations to the local 
Hamiltonian problem. Although at first glance, our approximation ratio of l/d k ~ 1 may 
appear an incremental improvement over the trivial random assignment strategy, there are 
three important notes that should be kept in mind: The first is that many classical NP-hard 
problems, such as MAX-3-SAT (a special case of M AX-fc-CSP where each constraint is the 
disjunction ("OR") of k variables or their negation), are approximation resistant (e.g. [132, 
2]), meaning that unless P=NP, there do not even exist non-trivial approximation ratios 
beyond the random assignment strategy. For example, for MAX-3-SAT it is NP-hard 
to do better than the approximation ratio of 7/8 achieved by random assignment [131]. 
Thus, showing the existence of a non-trivial approximation ratio is typically a big step 
in the classical setting. Moreover, it could have been conceivable that for MAX-fc-local 
Hamiltonian, analogously to MAX-3-SAT, outperforming the random assignment strategy 
would have been QMA-hard. Yet our results show that unless NP=QMA, this is not the 
case. The second important note that should be kept in mind is that our work considers 
the local Hamiltonian problem in its full generality by allowing arbitrary constraints on 
an arbitrary interaction graph. It could be (and is the case, for example, in [35]) that 
for more restricted classes of local Hamiltonian models, better approximation ratios are 
achievable. Third, the currently best approximation algorithm for MAX-fc-CSP gives an 
approximation ratio of only about 0.44/c/2 fc for k > 2 [63] (for k = 2, one can achieve 
0.874 [180]. See also the work of Raghavendra [ ]) and this is, moreover, essentially the 
best possible under a plausible complexity theoretic conjecture (namely, the Unique Games 
Conjecture [168]) [234, 130, 218, 32]. This is to be contrasted with our 2/2 fc -approximation 
ratio for the case of d = 2 (i.e. qubit systems), which we show can be achieved by product 
state assignments for arbitrary (i.e. even non-dense) MAX-fc-local Hamiltonian instances 
(in the non-dense case, however, we do not show how to efficiently find a product state 
achieving this ratio). This raises the important open question: Is our approximation ratio 
tight? 

Our product state approximation shows that approximating the local Hamiltonian prob- 
lem to within d~ k+1 is in NP. It would be interesting to know if this approximation ratio 
could also be achieved in polynomial time. If not, it might lead to an intriguing state of 
affairs where for low approximation ratios the problem is efficiently solvable, for medium 
ratios it is in NP but not efficiently solvable, and for high ratios it is QMA-hard (assuming 
a quantum PCP theorem exists). Further, as mentioned earlier, our work can be viewed as 
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negative progress towards a quantum PCP theorem in that, by Theorem 2.2, a quantum 
PCP theorem with hardness ratio c < dr k+l cannot exist unless NP=QMA. 

To obtain our results for the case of dense local Hamiltonians, we have introduced the 
exhaustive sampling technique of Arora et al. [28] to the setting of low-degree semidefinite 
programs. We linearize such programs using exhaustive sampling in combination with a 
careful analysis of the error coming from working with <5-nets on density matrices. We 
remark that it seems we cannot simply apply the results of [ ] for smooth Polynomial 
Integer Programs as a black-box to our setting. This is due to our aforementioned need 
for a 5-net, as well as the requirement that our assignment be a positive semidefinite 
operator. We address the latter issue by extending the techniques of [28] to the realm of 
positive semidefinite programs by introducing the notion of "degree-fc inner products" over 
Hermitian operators to generalize the concept of degree-fc polynomials over real numbers, 
and performing the more complex analysis that ensues. We hope that this technique will 
be of much wider applicability, particularly considering the growing use of semidefinite 
programs in numerous areas of quantum computing and information (e.g. [88, 155, 178]). 

Another open question is whether similar ideas can be used to approximate other QMA- 
complete problems, such as the Consistency problem [ 82]. Moreover, can we obtain poly- 
nomial time algorithms without the denseness assumption? And are there special cases of 
the local Hamiltonian problem for which there is a PTAS (other than for planar Ising spin 
glasses [35])? Of course, we do not expect a PTAS for all instances of the local Hamiltonian 
problem, as this would contradict known hardness results for special classical cases of the 
problem. However, perhaps there exist other classes of physically relevant instances of the 
problem for which a PTAS does exist. Finally, can our scheme be extended to work with 
more general classes of quantum assignments than product states, such as Matrix Product 
States [240]? 

Organization of this chapter: In Section 2.2, we prove our result on product state 
approximations (Theorem 2.9 and the ensuing proof of Theorem 2.2), show its tightness in 
the 2- local case and provide the upper bound of g? — L fc / 2 J f or the best possible approximation 
by product states. Section 2.3 gives our polynomial time approximation algorithm and 
develops the general sampling and SDP-based technique we use. It also shows that the 
dense local Hamiltonian problem remains QMA-complete. As some of the proofs and 
notation of Section 2.3 are rather technical, we have deferred the full proofs of this section 
to Section 2.4 in order to facilitate reading. 
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2.2 Product states yield a 1/d ^approximation for 
qudits 

We now show that product state assignments achieve a non-trivial approximation ratio for 
MAX-fc-local Hamiltonian, i.e. Theorem 2.2. To do so, we first define the recursive Schmidt 
decomposition (RSD, Definition 2.5) of a state G (C d )® n , and for ease of exposition, 
the corresponding notion of a Schmidt cut (Definition 2.6). We then state and prove the 
key to our approach, the Mixing Lemma (Lemma 2.7), which shows how to use the RSD 
to eliminate the entanglement across a particular Schmidt cut of while maintaining 
the desired approximation ratio. Lemma 2.8 and Theorem 2.9 then expand on this by 
showing how to apply the Mixing Lemma to multiple Schmidt cuts. From Theorem 2.9, 
a proof of Theorem 2.2 easily follows. We close with a discussion of the tightness of the 
approximation ratio given by Theorem 2.2. 

We first define the terms Recursive Schmidt Decomposition and Schmidt cut. 

Definition 2.5 (Recursive Schmidt Decomposition (RSD)). Given a state G (C d )® n , 
we define its recursive Schmidt decomposition as the expression obtained by recursively 
applying the Schmidt decomposition on each qudit from 1 to n — 1 inclusive. More formally, 
we define the RSD of as follows: 

• (Base case) If n = 1, then RSD(|-?/>)) = 

• (Recursive case) If n > 1, then RSD(|^>) = £ii ai|V>i)®RSD(|&)), where |^> G C d , 
\4>i) G (C^)®" --1 , Yli=i a i = 1; (l^i)} i s an orthonormal basis for the first qudit of 

and {|0j)} is a set of orthonormal vectors for the remaining n—1 qudits of 

(This definition is relative to some fixed ordering of the qudits. The specific choice of 
ordering is unimportant in our scenario, as any decomposition output by such a process 
suffices to prove Theorem 2.2.) For example, the RSD for 3-qubit is 

= ® Wx\h)\cx) + A,|&2>|02)) + « 2 |a 2 ) ® WtWx) + (3' 2 \b' 2 )\c' 2 )), (2.4) 

for a\ + al = j3\ + /3| = f3[ 2 + j3'\ = 1, {|ctj)}j an orthonormal basis for qubit 1, and 
{Wi)}i orthonormal bases for qubit 2, and {|cj)} i and {Ic'j)}^ orthonormal bases for qubit 
3. 

Definition 2.6 (Schmidt cut). For any \ip) G (C d )®" with Schmidt decomposition \ip) = 
Y,t=i®i\wi)\vi), where on G R with ^af = 1, \w t ) G C d and G (C d )®™-\ and for 
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any \<p) G (C d )® m , we refer to the expansion \<j>) <g> (X/»=i ^1^)1%)) as the Schmidt cut at 
qudit m + l. We say that a projector II crosses this Schmidt cut if II acts on qudit m + 1 
and at least one qudit i G {m + 2, . . . , m + n}. 

The heart of our approach is the following Mixing Lemma, which provides, for any as- 
signment \if}) G (C d )® n , an explicit construction through which the entanglement across the 
first Schmidt cut of can be eliminated, while maintaining at least a (1 /^-approximation 
ratio relative to the value achieves against any local Hamiltonian H G "H((C d )® n ). 

Lemma 2.7 (Mixing Lemma). Given state on n qudits with Schmidt cut on qu- 
dit 1 given by = Ylt=i a i\ w i) \ v i)> where a ; 6 M with ^\ of = 1, |u>j) G C d and 
\vi) G (C d ) 0n_1 ; define p := J2t=i a i\ w i)( w i\ ® \ v i)( v i\- Then, given projector II acting on 
some subset S of the qudits, if IT crosses i/ie Schmidt cut, then Tr(IIp) > 4Tr(II|^')('0|). 
Otherwise, Tr(LTp) = Tr(IT|-?/>)(?/>|). 

Proof. Case 2 follows easily by noting that the given Schmidt decomposition of implies 
Ttx(p) = Tri(|^)(^|) and Tr 2) ... )ri (p) = Tr 2) ... )n (|"0)(V'l)- To prove case 1, we observe by 
straightforward expansion that 

Tr(n|^|) = Tr(ITp) + ^ aiaj(wi\(vi\Il\wj)\vj) + a i aj(wj\(v j \Il\w i )\vi) . (2.5) 

i<j 

Then, by defining for each i vector |a«) := ajl\wi)\vi) , we have 

oijOij{wi\{vi\U\wj)\vj) + OiOij{wj\{vj\U\wi)\vi) = y^djlaj) + (aj\ai), (2.6) 

i<j i<j 

since n 2 = n. Applying the fact that (a\b) + (b\a) < \\ \a) \\ 2 2 + || |6) \\\ for \a), \b) G (C d )® n 
thus implies 

22( a i\ a j)+( a 3\ a i) - / , II K) II2+II l a i) II2 = {d-l)^^al{wi\{vi\U\wi)\vi) = (d-l)Tr(IIp), 

i<j i<j i 

(2.7) 

from which the claim follows. □ 

The following simple extension of Lemma 2.7 simplifies our proof of Theorem 2.9. 

Corollary 2.8. Define := \<f>) <g> \ip), where \<f>) G (C d ) 0m form > and j^) zs defined 
as in Lemma 2.7, and let p G V(C d )® n be obtained from as in Lemma 2.7. Then, for 
any projector U acting on a subset S of the qudits, if U crosses the Schmidt cut of 
at qudit m + l, we have Tr(II|0)(0| <g> p) > ^Tt(U\i/j')(^'\). Otherwise, Tr(II|0)(0| ® p) = 
Tr(n|^)(^l). 
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Proof. Immediate by applying the proof of Lemma 2.7 with the following modifications: 
(1) Define |a$) := aill\(j))\wi) \vi), and (2) if <S C {1, . . . , m} U {m + 2, . . . , m + n) (i.e. this 
is one of two ways for II not to cross the cut — the other way is for S C {1, . . . , m + 1}), 
observe that by the same arguments as in Lemma 2.7 for case 2 and the product structure 
between \<j>) and \ip) in \ip') that Tr m+1 (|0)(0| ® p) = Tr m+1 (|^') □ 

Lemma 2.7 shows that the state p obtained by mixing the d Schmidt vectors of \ip), as 
opposed to taking their superposition, suffices to achieve a (l/d)-approximation across the 
first Schmidt cut. By iterating this argument over alln — 1 Schmidt cuts, we now prove that 
a mixture of all (product) states appearing in the RSD of \ip) achieves an approximation 
ratio of l/d k ~ l . 

Theorem 2.9. For any n-qudit assignment \tp) with RSD = Ylt=i v/Pil'/'i); where 
^2iPi = 1 and {\<j>i)}i = i is a set of orthonormal product vectors in (C d )® n , define p : = 
Sf=i Pi\4>i){4>i\- Then, for any projector II acting on some subset S C {1, . . . ,n} of audits 
with \S\ = k, we have Tr(IIp) > ^iTr^l^)^!). 

Proof. Let II be a projector with |<S| = k, and define c e {0,l} n_1 such that c(j) = 1 
iff II crosses the Schmidt cut at qudit j. For example, if II acts on qudits {1,2}, then 
c = (1,0,..., 0). Note that in general 1 1 c 1 1 x = k — 1. Let \ipk) denote the expression 
obtained by taking the RSD of \tp) up to the kth level of recursion for 1 < k < n — 1, i.e. 

I^jfe) can be written 

d k 

ivo = Yl a ¥D ® ■ ■ ■ ® W) ® ( 2 - 8 ) 

i=l 

where \^{) G C d and |<&) G (C d )®"" fc . (We assume n > 2, as otherwise the claim is 
vacuously true.) Corresponding to \ipk), define 

d k 

p^ := J2°Z\ti)(ti\ ® • • " ® |V»f>W| ® (2-9) 
i=i 

Define := 5^ i=1 c(i). To prove our claim, we show by induction that for all 1 < k < n— 1, 
it holds that 

Tr(II|V>}(V>|) < rf Cfc Tr(np (fc) ). (2.10) 
Note that the case k = n — 1 is in particular the case we are interested in. 
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For the base case, let k = 1. Consider first the Schmidt cut of \ip) at qudit 1, i.e. 
Wi = TLiViWDlfa)* for Ml) E £d and l&) E (C d )® n_1 . Then, recalling that = 
Yli=i a l\' l Pi}{' l Pi\ ® \4>i) (4>i\i we have by Lemma 2.7 that 

Tr(n|^)(^|) < c/ c(1) Tr(np (1) ), (2.11) 

as desired. 

For the inductive step, assume the inductive hypothesis holds for some 1 < k < n — 2. 
We prove the claim holds for k + 1. Note that by Equation (2.10), which holds due to the 
induction hypothesis for our specific value of k, it suffices to show that 

Tr(n P (fc) ) < cf (fc+1) Tr(IIp (fc+1) ), (2.12) 

since d Ck+c ^ h+1 ^ = d Ck+1 . To show this holds, consider the ith term in Equation (2.9), 
l^Di^ll ® ■ ■ ■ ® I^X^I ® f° r arbitrary 1 < i < d k . Observe this term satisfies 

the preconditions for Corollary 2.8 with m = k. Hence, via Corollary 2.8 there exists a 
state <7j acting on qudits {k + 1, . . . , n} such that 

Tr^l^XV-l ® K fe >W| ® |0iX0<|) < ^^(nl^XV-l ® K fe >W| 

(2-13) 

Moreover, since cij in Corollary 2.8 is obtained via the Mixing Lemma (Lemma 2.7), by 
linearity we can express p( fc+1 ) as 

p {k+1) = J2 a MM\ ® •••« ) W>WI ® ^- ( 2 - 14 ) 

i=l 

We conclude by linearity that Equation (2.12) holds, completing the proof. □ 

With Theorem 2.9 in hand, we can now show Theorem 2.2, i.e. that product states 
achieve approximation ratio \jd k ~ x . 

Proof. (Theorem 2.2) Simply apply Theorem 2.9 to each projector in the spectral de- 
compositions of each (positive semidefinite) Hi in our MAX-fc-local Hamiltonian instance 
H = J2i Hi, and let \ip) denote the optimal assignment for H. It is important to note 
that we can exploit Theorem 2.9 in this fashion due to the fact that the p constructed by 
Theorem 2.9 is independent of the projector II — i.e. for any fixed \ip) and k, the state 
p provides the same approximation ratio against any fc-local projector II encountered in 
the spectral decompositions of the Hi. Finally, note that one can find a pure product 
state achieving this approximation guarantee since p is a convex mixture of pure product 
states. □ 



70 



I ft I 

Upper bound of rf ^J for product state approximations. Is the result of Theo- 
rem 2.2 tight? In the case of MAX-2-local Hamiltonian on qudits, yes — consider a single 
clause projecting onto the maximally entangled state 4= ^ \ii), for which a product state 
achieves value at most 1/d. On the other hand, for MAX-3-local Hamiltonian on qubits, 
the worst case clause for a 3-qubit product state assignment is the projector onto the state 
\W) = ^(|001) + |010) + |100>) [233]. But here product states achieve value 4/9 [252], 
implying the bound of 1/4 from Theorem 2.2 is not tight. 

An upper bound on the true optimal ratio of 8k 2 / (2 k ) is implied by Theorem 2 of [120] 
for the case where d = 2 and k > 11. For general d and k, a simple construction shows 
that the optimal ratio is upper bounded by d~^i. To see this, consider a single clause 
which is the tensor product of maximally entangled bipartite states (for odd k, we assume 
the odd qudit out projects onto the identity). For example, for n — 4, consider the clause 
\<f) + )((f) + \ <g) \<p + )(<f) + \, where \<f) + ) = ^=(|00) + |11)). The maximum value a product state 
can attain is 1/4, as claimed. In the qubit setting (d = 2), one can further improve this 
construction for odd k by replacing the term (g> / on the last three qubits with 

|W)(W|. For example, for k — 5, setting our instance to be the clause ® 
yields an upper bound of (1/2) (4/9) = 2/9 < 1/4 = cf~W (where we again use the value 
4/9 for \W) from the previous paragraph). For general odd k > 1, this improved bound 
generalizes to 2^~ /9. 

2.3 Optimizing over the set of separable states 

Section 2.2 showed that there always exists a product state assignment achieving a certain 
non-trivial approximation ratio. In this section, we show how to efficiently find such a 
product state. Our main theorem of this section is the following (Theorem 2.10), from 
which Theorem 2.4 follows easily (see discussion at end of Section 2.3.3). As the proofs 
and full notation of this section are rather dense, we first discuss our results below using 
simplified notation and without proofs. Full proofs and technical details are deferred to 
Section 2.4. 

Theorem 2.10. Let H be an instance of MAX-k-local Hamiltonian acting on n qudits, 
and let OPTp denote the optimum value ofTr(Hp) over all product states p G V((C d )® n ) . 
Then, for any fixed e > 0, there exists a polynomial time (deterministic) algorithm which 
outputs pi ® ■ ■ ■ ® p n G V((C d )® n ) such that Tr(Fpi <g> • • • ® p n ) > OPT P - en k . 

We first outline our approach by generalizing the discussion in Section 2.1, introducing 
tools and notation we will require along the way. The optimal value OPTp over prod- 
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uct state assignments for any MAX-£;-local Hamiltonian instance can be expressed as the 
following program, denoted Pi: 

n 

OPTp = max r Ti(H h) „, )ik p h <g) • • • <g> p ik ) s.t. p { y and Tr(pj) = 1 for 1 < i < n. 



n,...,ik 



(2.15) 

As done in Equation (2.3), we now recursively decompose our objective function as a 
sequence of nested sums. Let {<Ti}f =1 be a Hermitian orthogonal basis for the set of Her- 
mitian operators acting on C d , such that Tr(<Ji<jj) = 25ij (for 5ij the Kroenecker delta). 
(See, e.g. [169], or Equations (6.5), (6.6), and (6.7) for an explicit construction of such 
basis elements. We remark that there is nothing special about the normalization factor of 
2 in the term 25^- above; this value is simply consistent with the specific basis construction 
we have chosen to employ, which generalizes the Pauli basis for a qubit system.) Then, by 
rewriting each i h in terms of {o"i}f =1 , our objective function becomes 
<p 



E Tr 



l k ,...,tl 



E 



r h,-,h a 3k 



® °h P^® • ' " ' ' ' ® Ph 



^2^j k Pi k ] 

ik ,3k 



E ™ 



l k-l 



(2.16) 



where each r 



H, •••;lfc 



Dd 2 



. We henceforth think of the objective function above as a "degree- 



k inner product" , i.e. as a sequence of k nested sums involving inner products, in analogy 
to the degree-k polynomials of Reference [28]. In this sense, a degree-1 inner product 
would refer to only the innermost sums over i\ and j\, and a degree- /c inner product would 
denote the entire expression in Equation (2.16). More formally, we denote a degree-6 inner 
product for 1 < b < k using map tb : 'H(C d ) xn t— > M, defined such that 



h(Pi, ■ ■ ■ , Pn) ■= ^2Tr(a jb p lbJ 



ib,3b 



(2.17) 



Note that t b implicitly depends on parameters ib+i, ■ ■ ■ ,ik and jb+i, ■ ■ ■ ,jk- (See the be- 
ginning of Section 2.4 for more elaborate notation used in the proofs of the claims of 
Section 2.3.) 

Our approach is to "linearize" the objective function of Pi using exhaustive sampling 
and recursion to estimate its degree-(fc — 1) inner products. To do so, we require the 
Sampling Lemma. 
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Lemma 2.11 (Sampling Lemma [ ]). Let (cij) be a sequence of n real numbers with 
\o>i\ < M for all i, and let f,g > 0. If we choose a multiset of s = g \ogn of the at random 

(with replacement) , then their sum q satisfies £V aj — nMy^ < q x ^ < ^\ a, + nM ^J~^ 

with probability at least 1 — n - '' . 

The proof of Lemma 2.11 follows from a simple application of the Hoffding bound [142]. 
To use the Sampling Lemma in conjunction with exhaustive sampling, we discretize the 
space of 1-qudit density operators using a S- net G C "H(C d ), such that for all p G V(C d ), 
there exists a G G such that || p — a || F < 5. We now show how to construct G. 

To obtain G, we instead construct a 5-net for a subset of "H(C d ) which contains V(C d ), 
namely the set ^l(C d ) := {A G %(C d ) | max^ |A(z,j)| < l}. (Note: A net over A(C d ) 
may allow non-positive assignments for a qudit. See Section 2.3.3 for why this is of no 
consequence.) Creating a 5-net over A(C d ) is simple: we cast a (<5/<i)-net over the unit 
disk for each of the complex d(d — 1)/2 matrix entries above the diagonal, and likewise over 
[—1,1] for the entries on the diagonal. Letting m and n denote the minimum number of 
points required to create such (5/<i)-nets for each of the diagonal and off-diagonal entries, 

respectively, we have that \G\ = m ( 2 ~n d . For example, simple nets of size m ~ d/5 
and n w d 2 /5 2 can be obtained by placing a ID and 2D grid over [—1, 1] and the length 
2 square in the complex plane centered at (0,0), respectively, implying \G\ G 0(1) when 
d G 0(1). To show that G is indeed a 5-net, we now bound the Frobenius distance between 
arbitrary p G V(C d ) and the closest p G G. (We use the Frobenius norm as it allows a 
simple analysis. Below, one could also consider the norm bound || A || < 5/d, where 
in this context || A = max^ \A(i, Specifically, let A := p — p. Then: 



Finally, we remark that our dense assumption on MAX-fc-local Hamiltonian instances 
is only necessary to convert the absolute error of Theorem 2.10 to a relative one (this 
conversion is detailed in Section 2.3.3). A dense assumption is not needed to apply the 
Sampling Lemma: Specifically, observe that Lemma 2.11 assumes there are n terms in 
the sum to be estimated, and that we are able to determine s of them. Looking back 
at Equation (2.1) and considering, say, qudit i, if we wish to use the Sampling Lemma 
to estimate the inner sum over neighbours N(i) of i, we might run into a problem if i 
does not have 0(n) neighbours. To circumvent this [28], observe that Lemma 2.11 only 
gives us an estimate to within ±en. Thus, if N(i) < en/ 10 (say), then we do not use 




(2.18) 
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the Sampling Lemma, but rather let our estimate be simply 0, which is guaranteed to fall 
within the desired error bounds (observe an estimate of does not necessarily work, on 
the other hand, if N(i) is large (say N(i) = n — 1), since typically f/g < 1). Throughout 
the remainder of our discussion, we assume this cutoff principle is implicitly present when 
employing Lemma 2.11. 

The remaining sections of this chapter are organized as follows: In Section 2.3.1, we 
show how to recursively estimate degree-6 inner products using the Sampling Lemma. We 
then use this estimation technique in Section 2.3.2 to linearize our optimization problem 
Pi. Section 2.3.3 brings everything together by presenting and analyzing the complete 
approximation algorithm. All technical proofs are found in Section 2.4. 

2.3.1 Estimating degree-6 inner products via sampling 

Our recursive procedure, EVAL, for estimating a degree-6 inner product using the Sampling 
Lemma is stated as Algorithm 2.12. There are two sources of error we must analyze: the 
Sampling Lemma, and our 5-net over C d . We claim that EVAL estimates the degree-6 
inner product . . . ,p n ) to within additive error ±€bn b , where e& is defined as follows. 
Set A := \^2d(l + 5), for 5 from our 5-net. Then, 



The following lemma formalizes this claim. We adopt the convention of [28] and let x G y±z 
denote x G [y — z, y + z] . Algorithm 2.12 is our operator analogue of the algorithm Eval 
in Section 3.3 of [28]. 

Lemma 2.13. Let t k : H{C k ) xn H> E be defined using set {# n ,...,; fe } C H((C d )® fc ) (as 
in Equation (2.16)). Let S C {1, . . . ,n} such that \S\ = g\ogn have its elements chosen 
uniformly at random with replacement. Let pi,...,p n G V(C d ) be some assignment on 
all n qudits, and {pi : i G S} a set of elements in our 5-net such that \\ pi — pi || F < 5 
for all i G S. Then, for 1 < b < k, with probability at least 1 — d 2b n h ~^ , we have 
EVAL(tb, S, {pi : i G S}) G h(pi, ■ ■ ■ ,p n ) ± where eb is defined as in Equation (2.19). 

2.3.2 Linearizing our optimization problem 

Our procedure, LINEARIZE, for "linearizing" the objective function of P\ using EVAL 
from Section 2.3.1 is stated as Algorithm 2.14. Algorithm 2.14 takes as input P\ and 




(2.19) 
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Algorithm 2.12. EVAL( t b , S , {pi : i G S} ). 

• Input: (1) A degree-6 inner product tb : T-L(C d ) xn K for 1 < b < k 

(2) A subset S C {1, . . . , n} of size |5| = O(logn) 

(3) Sample points {pi : z G 5} such that || p~j — pi || F < 5 for all i G 5 

• Output: x G K such that x G £&(pi, . . . , p n ) ± (for e& defined in Equation (2.19)). 

1. (Base Case) If b = 1, return ^ X^esT 1 - ((S^ 1= i r Ti,..tJk a ii) P^J- (^°^ e ^ n i s return 
value depends on i 2 , . . . , i^, j'2, • • • , jk, which are assumed to have a fixed value in the 
current recursive call to EVAL.) 

2. (Recurse) For alH G S and j = 1 . . . d 2 , set = EVAL(^ 3 _ 1 , 5, {p^ : z G S 1 }), where 

is the term to the right of Ti^o^pjJ in Equation (2.17). 

3. Return ^ 5] ie5 ^. =1 'ri'(rr ; p,) ( , ; 



a set of sample points {pi}, and outputs a semidefinite program (SDP) which we shall 
henceforth refer to as Pi- We remark that LINEARIZE is our version of the procedure 
Linearize in Section 3.4 of [28], extended to the setting of operators and a more complex 
error structure. Although LINEARIZE is presented as linearizing an objective function 
here, the same techniques straightforwardly apply in linearizing constraints involving high- 
degree inner products. 

We remark that the linear constraints output on each recursive call on line 3(b) of 
Algorithm 2.14 ensure the approximate consistency with our estimates from EVAL for any 
solution to P2, as well as play a crucial role in bounding how good of an approximation P2 
yields to P\. 

To prove correctness of our final approximation algorithm, we require the following two 
important lemmas regarding P 2 . The first shows that any feasible solution (pi,...,p n ) 
for Pi consistent with the sample set {pi : i G S} fed into LINEARIZE is also a feasible 
solution for P 2 with high probability. 

Lemma 2.15. Let t k , assignment (p 1; . . . ,p n ), S, and {pi : i G S} be defined as in Lemma 
2.13. Then, for any f,g > 0, calling LINEARIZE with parameters t^, {pi : i G S}, and 
e = e fc (f or e fc defined in Equation (2.19)) yields an SDP P 2 for which the assignment 
{pi, . . . , p„} is feasible with probability at least 1 — d 2k n k ~f . 
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Algorithm 2.14. LINEARIZE( t b , M , S, {pi : i G S}, e, U, L ). 

• Input: (1) A degree-fe inner product U : 'H(C d ) xn i->- R for 1 < 6 < A;. 

(2) A set of linear constraints M (e.g. "pi >: 0"). 

(3) A subset S C {1, . . . , n} of size |5| = O(logn). 

(4) Sample points {pj : i G 5} consistent with some feasible solution 
(pi, . . . , p„) for Pi such that || pi — pi || F < 5 for all i E S. 

(5) An error parameter e > 0. 

(6) (Optional) upper and lower bounds U, L G R. If C/ and L are not provided, 
we assume U,L — oo. 

• Output: (1) (Optional) A linear objective function / : (C(C d )) xn -> R. 

(2) An updated set of linear constraints, M . 

1. (Base case) If 6 = 1, then 

(a) (Trivial: Initial objective function was linear) If U = L = oo, return [tb, Af\. 

(b) (Reached bottom of recursion) Else, return [A^U {"L < £&(pi, . . . ,p n ) < f/"}]. 

2. (Recursive case) For % = 1 . . . n and j = 1 . . . <i 2 do 

(a) Set := EVAL^Id 5, {pi ■ % G 5}). 

(b) Set e' := e - dt (\/| + 5 ) A6_1 > for A defined in Equation (2.19). 

(c) Set ijj := ejj — e'n fc_1 and Uy := + e'n 6-1 . 

(d) Call LINEARIZE(4i 1 , AT, S, {p^ : % G S 1 }, e', Wy, Zy). 

3. (a) (Entire computation done) If U = L = oo, return Ti (a jPi)eij,Af 
(b) (Recursive call done) Else, return 

A" U | "L - e'd 2 n b < Tr(<r jp^eij <U + e'd 2 n 6 " }] . 



The second lemma is a bound on how far the optimal solution of P2 is from the optimal 
solution for Pi. We adopt the convention of [28] and write [x,y] ± z to denote interval 
[x-z,y + z\. 

Lemma 2.16. Let OPTp be the optimal value for P\, obtained by assignment p OPTp : = 
(p° pt , . • • ,Pn pt )- Let assignment {pj}™ =1 = {p° pt } i=1 , S, and {pi : i G S} be defined as 
in Lemma 2.13. Let P 2 denote the SDP obtained by calling LINEARIZE with S, and 
denote by e m for 1 < m < k the error parameter passed with map t m into a (possibly 
recursive) call to LINEARIZE. Then, letting OPT 2 denote the optimal value of P2, we 
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Algorithm 2.17. APPROXIMATE( H , e ). 



• Input: (1) A fc-local Hamiltonian H = J2i lr ..,i k H h,.,i k for each H h,-,i k e H((C d ) 0fc ). 

(2) An error parameter e > 0. 

• Output: A product assignment p\ <g> ■ ■ ■ £g> p n that with probability at least 1/2, has 

value at least OPTp — en k , for OPTp the optimal value for H over all 
product state assignments. 

1. Set e s d p := e/10. 

2. Define h : R — > K such that for any error parameter e input to LINEARIZE, h(e)n h 
is the absolute value of the bound on additive error given by Lemma 2.16. Then, 
define e' implicitly so that h(e') + e s d p = e holds. 

3. Define constant / such that 1 - d 2k n k ~ f > 1/2. 

4. Define constants g and S implicitly so that e' = (ysj^ + $j (^t)> ^ or ^ defined 
in Equation (2.19). 

5. Choose glogn indices S C {1, . . . ,n} independently and uniformly at random. 

6. For each possible assignment i from our <5-net to the qudits in S: 

(a) Call LINEARIZE^, {P^s constraints}, S,i,e') to obtain SDP P*. 

(b) Let oil denote the value of P\ obtained by substituting in the optimal solution 
ofP 2 \ 

7. Return the assignment corresponding to the maximum over all a,. 



have with probability at least 1 — d 2k n k ' (for parameters set as in Lemma 2.15) that 
OPT 2 G OPTp ± did + s/2) [Et=i(v / 2rf) fe - 1 - m e J n k 



2.3.3 The final algorithm 



We finally present our approximation algorithm, APPROXIMATE (Algorithm 2.17), in its 
entirety, which exploits our ability to linearize Pi using LINEARIZE (Algorithm 2.14). This 
proves Theorem 2.10, which in turn implies Theorem 2.4. We first clarify a few points about 
APPROXIMATE, then analyze its runtime, and follow with further discussion, including 
the algorithm's derandomization and a proof that dense MAX-fc-local Hamiltonian remains 
QMA-hard. 



77 



We begin by explaining the rationale behind the constants in Algorithm 2.17. The 
constant e s d p is the additive error incurred when solving an SDP [121]. We choose e' so 
that after running LINEARIZE and solving P%, the total additive error is at most e, as 
desired. We choose / to ensure the probability of success is at least 1/2. Finally, we set g 
large enough and 5 (for our 5-net) small enough to ensure that e' matches the error bounds 
for EVAL in Lemma 2.13. 

We now analyze the runtime of Algorithm 2.17. Let |G| denote the size of our <5-net 
G for a qudit. Then, for each of the |G| 9logn iterations of line 6, we first take 0(n k ~ l ) 
time to run LINEARIZE, outputting 0{n k ~ x ) new linear constraints (seen via a simple 
inductive argument). We then solve SDP which can be done in time polynomial in 
n and log(l/e s d p ) using the ellipsoid method [121] (see, e.g., [249]). Let r(n,£ s d p ) denote 
the maximum runtime required to solve any of the P\. Then, the overall runtime for 
Algorithm 2.17 is O(n glog ^(n k ~ 1 + r(n, e S dp))) ? which is polynomial in n for e,d,k G 
0(1) (recall from Section 2.3 that \G\ E 0((j) d ), and that 5 and g are constant in our 
setting). Note that, due to the implicit dependence of g on e, this runtime scales at least 
exponentially with varying e. 

Before moving to further discussion, we make two remarks. First, one can efficiently 
convert the output of Algorithm 2.17 to a pure state with the same guarantee by adapting 
the standard classical method of conditional expectations [ ] . To demonstrate, suppose 
{pi} is output by Algorithm 2.17. Then, set p[ to be the eigenvector \ipj)(ipj\ of p\ for 
which the assignment \ ® P2 ® • • • ® p n performs best for P x . (If the spectrum of pi is 

degenerate, begin by fixing an arbitrary choice of spectral decomposition for pj.) Let our 
new assignment be p\ <g> p2 <8> • • • ® p n - Now repeat for each p; for 2 < % < n. The final state 
p'x <8> • • • ® p' n is pure, and by convexity is guaranteed to perform as well as pi <8> • • • <8> p n . 

Second, recall from Section 2.3 that we constructed a 8-net over a space larger than 
V(C d ), allowing possibly non-positive assignments for a qudit. We now see that this is of 
no consequence, since regardless of which samples (positive or not) we use to derive our 
estimates with the Sampling Lemma, any feasible solution to P\ in Algorithm 2.17 is a 
valid assignment for P\. Moreover, we know that for each optimal pi for Pi, there must 
be some operator (positive or not) within distance 5 in our net, ensuring our estimates 
obtained using the Sampling Lemma are within our error bounds. 

Converting the absolute error of Algorithm 2.17 into relative error. To convert 
the absolute error ±en k of Algorithm 2.17 into a relative error of 1 — e' for any e', define 
constant c such that cn k is the value obtained for a MAX-fc-local Hamiltonian instance 
by choosing the maximally mixed assignment I/d n (analogous to a classical random as- 
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signment). Since I/d n can be written as a mixture of computational basis states, we have 
OPTp > cn k . It follows that by setting e = ce', Algorithm 2.17 returns an assignment with 
value at least OPT P - ce'n k > OPT P - e'OPT P > OPT P (l - e'), as desired. 

Derandomizing Algorithm 2.17. The source of randomness in our algorithm is Lemma 
2.11. By a standard argument in [28] (see also [42, 41]), this randomness can be eliminated 
with only polynomial overhead. Specifically, we replace the random selection of glogn 
indices in the Sampling Lemma with the set of indices encountered on a random walk of 
length O (glogn) along a constant degree expander [113]. Since the expander has constant 
degree, we can efficiently deterministically iterate through all such walks, and since 
such a walk works with probability l/n 0< ^\ at least one walk will work for all poly(n) 
sampling experiments we wish to run. 

QMA-hardness of dense MAX-fc-local Hamiltonian. It is easy to see that (exact) 
MAX-2-local Hamiltonian remains QMA-hard for dense instances (a similar statement 
holds for MAX-2-SAT [28]). For any MAX-2-local Hamiltonian instance with optimal value 
OPT, we simply add n qudits, between any two of which we place the constraint 1 00) (00 1 
(no constraints are necessary between old and new qudits). Then, the new Hamiltonian 
has optimal value OPT + (™), making it dense, and the ability to solve this new instance 
implies the ability to solve the original one. The argument extends straightforwardly to 
MAX-£;-local Hamiltonian for k > 2. 



2.4 Further technical details and proofs 

We now prove our claims in Section 2.3. For this, we first require expanding on the notation 
we have set thus far. 

Expanded Notation. We now expand on our previous notation for analyzing Equa- 
tion (2.16) in order to facilitate proofs of the claims in Section 2.3. First, to recursively 
analyze a clause H h ^ ik C H({C d ) m ), let H b G n((C d )® b ) for any 1 < b < k denote the 
action of i?^,...,^ restricted to the first b of its k target qudits, i.e. 




(2.20) 



jb,-,h=l 
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For example, H\ = Ylj 1 =i r fC'.tJk tT ^ anc ^ ^ k = ^h,-,h- Note that H b implicitly depends on 
variables i±, . . . , ik,jb+i, ■ ■ ■ ,jk- To reduce clutter, however, our notation does not explicitly 
denote this dependence unless necessary. Next, to recursively analyze a degree-a inner 
product, we define t a ^ : H(C d ) xn h-> K. for any < a < k and 1 < b < k such that 

n 

t a , b ( Pl ,..., Pn ):= ^{H l b ^' ,lk P^®---®Pn) (2-21) 

i a ,.. .,11=1 

(where setting a = eliminates the sum over indices i). For example, tk,k is our full 
"degree-fc" objective function in Equation (2.15), and more generally, tb,b is the degree-b 
inner product in Equation (2.16). Allowing different values for a and b greatly eases our 
technical analysis. We use the shorthand t b to denote t bj b, and again only explicitly denote 
the dependence of t ab on parameters i a +i, ■ ■ ■ ,ik and jb+i, ■ ■ ■ ,jk when necessary. 

We now state and prove a technical lemma required for the remainder of our proofs 
here. 

Lemma 2.18. Let {pi} n l=1 C H{C d ). For {F ilr ..,iJ C H(C dk ) any MAX-k-local Hamilto- 
nian instance with decomposition for the H ix ^ ^ ih as given in Equation (2.16), we have for 
anyO<a<k and 1 < b < k that \t aib (p u . . . ,p n )\ < (max^...^ || p ib || F • • • || p h || F ) dhn a . 

Proof. By the triangle inequality and the Holder inequality for Schatten p-norms (see 
Section 1.3), we have 



l a,b\ 



Tz{Hbp ib ®---®p h ) 

,...,U = 1 



< II H b Hp II Ph ® • • • ® Ph ll F (2-22) 

ia,--;h=l 

< ( max \\p ib || p --- || || F J W H b 



l b Hf ) 



where we have used the fact that || A ® B || F = || A || F || B || F for all A,B G £(C d ). If we 
can now show that || H b \\ F < || || F for all 1 < b < k, then we would be done since we 
would have i II Hb Hp — II B-k Hf nCL < <i^n a , where || H k || F < c?^ since || f/fc < 1 by 

definition. Indeed, we claim that for any fixed 1 < b < k, we have || H b || F < 2 V" || B\ k || p . 
To see this, note by straightforward expansion of the Frobenius norm and the fact that 
Tr (oiOj) = 25ij that 



H b \\ F = JTr(H, 



r2\ 




II T h, 


a ll b-k 




| r* 1 " 


■,ik } 


- 1 * || 2 = 2^ 


(25 










\\2 



3b, ■■■Jk 



(2.23) 



SO 



where r J1,, " ,lfc is the coordinate vector of Hi lr __^ k from Equation (2.16). Note, however, then 

fen ■ ■ 

for b = k, the inequality in the chain above is an equality, and so || || F = 22 || r ll, --' tk \\ 2 . 
Substituting this into the chain above completes the proof of our claim. □ 

We now prove our claims of Section 2.3. 

Proof of Lemma 2.13. We first derive the error bound of e^, and subsequently prove 
the probability bound. We follow [28], and proceed by induction on 6. For the base case 
6 = 1, EVAL(if!, S, {pi : i G S}) attempts to estimate 



ti(p U . . . , p n ) = 



E 



.a 



(2.24) 



using our flawed sample points {pi : i G S}. To analyze the error of its output, assume 
first that our sample points are exact, i.e. pi = pi for all i <E S. Then, by setting u a" in 
Lemma 2.11 to t^i f° r i — H, and by using Lemma 2.18 with parameters a = and 6 = 1 
to obtain upper bound M = , we have by the Sampling Lemma that (with probability 
at least 1 — n~f) 



n 

W\ 



E 



.a 



G £i(pi, . . . ,p n ) ±d^j -n. 

9 



(2.25) 



(Recall that the notation x G y ± z means here x G [y — z,y + z\.) This bound holds if we 
sum over exact sample points. If we instead sum over flawed sample points {pi : i G S} , 
the additional error is bounded by ^ times 



E 



.a 



< 



< 



JL^-l^n^i ~Ph)) 



E 

hes 



heS 



(2.26) 
(2.27) 
(2.28) 



< d 2 6n, 

where the second inequality uses Lemma 2.18 with parameters a = and 6=1 and the 
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promise of our 5-net. We conclude for the base case that, as desired, 



EVAL(Fi, S, {Pi-.ieS}) 



n 

W\ 



E 



.11 



G ti(pi, . . . ,/>„) ± da ( J- + 5 | n. 



(2.29) 
(2.30) 



Assume now that the inductive hypothesis holds for 1 < m < b — 1 . We prove the claim 
for m = b. To do so, suppose first that the recursive calls on line 1(b) of Algorithm 2.12 
return the exact values of t^_ 1 (pi, . . . ,p n ), and that we have exact samples {pi : i G S}. 



Then, since by calling Lemma 2.18 with a = b — 1 we have 



< 



dsn 6 1 , it follows by the Sampling Lemma that 



n 

W\ 



E 



ies L j 



^2^( y a J p i )t l b J _ 1 (p 1 ,...,p n ) 



E 



i=i 



-dKl-n b . 



(2.31) 

To first adjust for using flawed samples, observe that an analogous calculation to Equa- 
tion (2.28) yields |j J2ies ^(vjiPi ~ Pi)) — d^5n , where we have called Lemma 2.18 
with a = b — 1. Thus, using flawed samples, the output of Algorithm 2.12 satisfies 



n 

W\ 



E 

ies 



n 

^E 

i=i 



E ^(wK-i 



±S [ xl f - + 5\n b . (2.32) 



To next drop the assumption that our estimates ey on line 1(b) are exact, apply the 
induction hypothesis to conclude that G 4' 7 _ 1 (pi, . . . , p n ) ± ef,_in 6-1 . Then, 



n 

W\ 



E 



J^Tr (a 



C 



C 



n 
LSI 



n 

W\ 



n 

LSI 



E 
E 
E 



^Tr^p^Ci 



± 



|5| 



E 

i&S 



■ d 2 

^Ti (a j Pi) 



ies L i 



E Tr ^^)^-i 



± e 6 _i\/2d(l + 5)n 6 , (2.33) 
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where the last statement follows since 



d 2 

3=1 




d 2 

^E 

m=l 



<2d\\r || 2 < y/2d(l + 5), 



(2.34) 

where f denotes the coordinate vector of p, with respect to basis {cr m }, and we have used 
the facts that Tr(o"j<Tj) = 25^, that || x \\ l < yd || x || 2 for x G C d , that || pi || F = \/2 || f || 2 



for any pi G "H(C ), and that || Pi || F < 1 + 5 (which follows from our £-net and the triangle 
inequality). Thus, recalling that A = \/2d(l + S) and substituting Equation (2.32) into 
Equation (2.33), we have that 



n 

W\ 



E 



ieS L j 



G t b (p u ...,p n )± 




et-iA 



(2.35) 



We hence have the recurrence relation e& < da (y\J^ + S\ + e^A, which when unrolled 
yields 




e 6 < 

a / -m=0 \ V 3 

as desired. This concludes the proof of the error bound. 



(2.36) 



To prove the probability bound, we show a stronger bound of 1 — (^ m=0 d 



b 1 Mm^m 



n )n 



-f 



by 

induction on b. The base case 6 = 1 follows directly from our application of the Sampling 
Lemma in Equation (2.25). For the inductive step, define for brevity of notation 7 := d 2 n, 
and apply the induction hypothesis to line 1(b) of Algorithm 2.12 to conclude that each 
of the 7 calls to EVAL fails will probability at most (Ylm=o l m ) n ~^ ■ Then, by the union 
bound, the probability that at least one call fails is at most (Ylm=i l m ) n ~^ • Similarly, since 
our application of the Sampling Lemma in line 2 of Algorithm 2.12 fails with probability at 

most n~f, we arrive at our claimed stronger bound of 1 — ( Y^m=o1 m ) ■, as desired. □ 



Proof of Lemma 2.15. We begin by observing that if one sets e = e^, then the value of 
e' in line 2(b) of Algorithm 2.14 is precisely e^-i, and more generally, the e passed into the 
recursive call of line 2(e) on for any 1 < b < k is e&. Now, focus on some recursive call on 
tb for b > 1 (the case of b = 1 is straightforward by Lemma 2.13). If the estimates ey in line 
2(a) succeed, then by Lemma 2.13, we know that e^- G t^-^px, ■ ■ ■ , p n )±€b-in b ~ 1 , implying 
£{f_i(pi, • • • , Pn) £ [hj,Uij]- Now, and Uij are only incorporated into linear constraints in 
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recursive calls on t b _ 1 , yielding constraints of the form 



2„6-l 



k bjb - e b . 2 d 2 n b 1 < Ttfat-iPib-ifa-dt-i ^ u %ib + ^b-2d 2 n 

But {pi, . . . , p n } must now satisfy this constraint, since recall 

*6_l(pi, . . . ,p„) = 1^(0-i i -iPt6-l)C-2 1Jt_l (Pl» • • • >Pn)> 



(2.37) 



(2.38) 



and there are d 2 n terms e% b _^ b _ x in Equation (2.37) each yielding an additional error of at 
most e&_2n (assuming EVAL succeeded on t h h Z^ h ~ Y in line 2(a)) above and beyond the 
bounds t^_ 1 (pi, . . . , p n ) G [kj, Uij] we established above. 

We conclude that if, for all b, i, and j, EVAL succeeds in producing estimates ef , then 
{pi, . . . , p n } is a feasible solution for P 2 , as desired. The probability of this happening 
is, by the proof of Lemma 2.13, at least 1 — d 2k n k ~f , since EVAL recursively estimates 
precisely the same terms during its execution 1 . □ 

Proof of Lemma 2. 1 6. We begin by proving that for any recursive call to LINEARIZE 
on tb with valid upper and lower bounds U and L (i.e. U,L ^ 00), respectively, we have 
for any feasible solution (p 1; . . . , p„) to P 2 that 



t b { Pl ,...,p n ) G [L, U] ± d(d + V2) 



b-l 



^(V2d) 6 - 1 - m e r 



m=l 



(2.39) 



We prove this by induction on b, following [ ]. For base case b = 1, the claim is trivial 
by line 1(b) of the algorithm. Now, assume by induction hypothesis that 



t^ipi, ...,p n ) e [hj,Uij] ± d(d+ V2 



b-2 



b-2-m. 



m=l 



n 



(2.40) 



By substituting the values of Uj and Uy from line 2(c), we have 



tl-i(pi, ■ ■ ■ , Pn) e ± ld(d+ y/2) 



b-2 



b-2-m. 



m=l 



b-l 



(2.41; 



1 This holds even though on line 1 of Algorithm 2.12, we only estimate d 2 \S\ of the terms (i.e. EVAL 
does not actually estimate all terms in the recursive decomposition of tk, as it does not need to) — this is 
because in our analysis of the probability bound for Algorithm 2.12, we actually produced a looser bound 
by assuming all n terms are estimated. 
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We conclude that 



t b (p u . . . , p n ) 



C 



d(d+V2 



y 

6-2 



+ 



6-2-m, 



m=l 



+ 66-1 



^Tr^p; 



(2.42) 
(2.43) 
6-1 (2.44) 



C 



. ij 

V2d {^d(d + V2 



+ 



6-2 



C [[L, U] ± e 6 _idV] + V2d d(d + V2 



^(v / 2d) 



i 



6-2-m, 



m=l 



+ e 6 _i n° 



6-2 



6-2-m, 



C [L, U] ± d(d + y/2) 



6-1 



m=l 



m=l 



(2.45) 



+ e 6 -i n° 



(2.46) 



where the third statement follows from a calculation similar to Equation (2.34), and the 
fourth statement from line 3(b) of Algorithm 2.14. This proves the claim of Equation (2.39). 

To complete the proof of Lemma 2.16, observe that by Lemma 2.15, the assignment p opt 
is feasible for P 2 with probability at least 1 — d 2k n k ~f . Thus, plugging p opt into each of the 
d 2 n linear constraints produced by the recursive calls to LINEARIZE on each t l k 3 _ x , we have 
by Equations (2.39) and (2.45) that (with probability 1 - d 2k n k ~ f ) for OPT P = t k (p opt ), 



tk{p 



opts 



(2.47) 



C 



^Tr^p^ey 



L i] 



C OPT 2 ±d(d + V2) 



± V2d f d(d + V2) 

'k-l 

Y,{^d) k ~ l - m e m 



k-2 



k—2—m , 



m=l 



■m=l 



n 



+ e fc _i n" 



(2.48) 



where the last statement follows since p opt is not necessarily the optimal solution to P%. □ 
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Chapter 3 



Hardness of approximation for 
quantum problems 

This chapter is based on [109]: 

S. Gharibian and J. Kempe. Hardness of approximation for quantum problems. In Proceed- 
ings of 39th International Colloquium on Automata, Languages and Programming, pages 
387-398, 2012, DOI: 10.1007/978-3-642-31594-7, © 2012 Springer, www.springerlink.com. 

The polynomial hierarchy plays a central role in classical complexity theory. In this chapter, 
we define a quantum generalization of the polynomial hierarchy, and initiate its study. We 
show that not only are there natural complete problems for the second level of this quantum 
hierarchy, but that these problems are in fact hard to approximate. Using these techniques, 
we also obtain hardness of approximation for the class QCMA. Our approach is based on 
the use of dispersers, and is inspired by the classical results of Umans regarding hardness 
of approximation for the second level of the classical polynomial hierarchy [235] . We close 
the chapter by showing that two variants of the local Hamiltonian problem with hybrid 
classical-quantum ground states are complete and hard to approximate for the second level 
of our quantum hierarchy, respectively. 

3.1 Introduction and results 

Over the last decades, the Polynomial Hierarchy (PH) [193], a natural generalization of 
the class NP, has been the focus of much study in classical computational complexity. Of 
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particular interest is the second level of PH, denoted E^. Here, we say a problem is in Ef 
if it has an efficient verifier with the property that for any YES instance x G {0, 1}™ of 
the problem, there exists a polynomial length proof y such that for all polynomial length 
proofs z, the verifier accepts x, y and z. Note that the alternation from an existential 
quantifier over y to a for-all quantifier over z is crucial here - keeping only the existential 
quantifier reduces us to NP. 

It turns out that introducing such alternating quantifiers makes E2 a powerful class 
believed to be beyond NP. For example, there exist natural and important problems known 
to be in E^ but not in NP. Such problems range from "does the optimal assignment to 
a 3SAT instance satisfy exactly k clauses?" to practically relevant problems related to 
circuit minimization, such as "given a boolean formula C in Disjunctive Normal Form 
(DNF), what is the smallest DNF formula C equivalent to C?" (see, e.g. [235]). The 
study of Eg has also led to a host of other fundamental theoretical results, such as the 
Karp-Lipton theorem, which states that NP <2 P/ po i y unless PH collapses to E^. E2 has 
even been used to prove that SAT cannot be solved simultaneously in linear time and 
logarithmic space [98, 99]. For these reasons, E| and more generally PH have occupied a 
central role in classical complexity theoretic research. 

Moving to the quantum setting, the study of quantum proof systems and a natural 
quantum generalization of NP, the class Quantum Merlin Arthur (QMA) [] ], has been a 
very active area of research over the last decade. Recall from Section 1.5.2 that a problem 
is in QMA if for any YES instance of the problem, there exists a polynomial size quantum 
proof convincing a quantum verifier of this fact with high probability. With the notion of 
quantum proofs in mind, we thus ask the natural question: Can a quantum generalization 
of be defined, and what types of problems might it contain and characterize? Perhaps 
surprisingly, to date there are almost no known results in this direction. 

Our results: In this chapter, we introduce a quantum generalization of E2, which we call 
cq-E 2 , and initiate its study. Our results include cq-E 2 -completeness and cq-E 2 -hardness 
of approximation for a number of new problems we define. Our techniques also yield hard- 
ness of approximation for the complexity class known as QCMA. We now describe these 
results in further detail. 



Hardness of approximation for cq-E2. To begin, we informally define cq-E2 (see 
Section 3.2 for formal definitions). 
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Definition 3.1 (cq-£2 (informal)). A problem U is in cq-X2 if there exists an efficient 
quantum verifier satisfying the following property for any input x G {0, l} n : 

• If x is a YES instance ofU, then there exists a classical proof y G {0, i} poly ( n ) such 
that for all quantum proofs \z) G B® poly<yn \ the verifier accepts x, y and \z) with high 
probability. 

• If x is a NO instance ofU, then for all classical proofs y G {0, i} poly (™) ; there exists 
a quantum proof \z) G B^ poly ^ such that the verifier rejects x, y and \z) with high 
probability. 

(Recall here that B := C 2 .) We believe this is a natural quantum generalization of Sf. 
Here, the prefix cq in cq-S 2 follows since the existential proof is classical, while the for- 
all proof is quantum. One can also consider variations of this scheme such as qq-S 2 , 
qc-E2, or CC-S2 (with a quantum verifier), defined analogously. In this chapter, however, 
our focus is on cq-S 2 , as it is the natural setting for the computational problems for 
which we wish to prove hardness of approximation. Note also that unlike for Ef, the 
definition of cq-S 2 is bounded error - this is due to the use of a quantum verifier for 
cq-£ 2 . This implies, for instance, that the quantum analogue of the classically non-trivial 
result BPP C S 2 [227, 177], i.e. BQP C cq-£ 2 , holds trivially. Finally, one can extend the 
definition of cq-£2 to an entire hierarchy of quantum classes analogous to PH by adding 
further levels of alternating quantifiers, attaining presumably different classes depending 
on whether the quantifier at any particular level runs over classical or quantum proofs. 

To next discuss hardness of approximation for cq-£ 2 , we recall two classical problems 
crucial to our work here. First, in the NP-complete problem SET COVER, one is given a 
set of subsets {Si} whose union covers a ground set U, and we are asked for the smallest 
number of the Si whose union still covers U. If, however, the Si are represented succinctly 
as the on-set 1 of a 3-DNF formula (pi, we obtain a more difficult problem known as SUC- 
CINCT SET COVER (SSC). SSC, along with a related problem IRREDUNDANT (IRR), 
are not just NP-hard, but are S^-complete (indeed, they are even S^-hard to approxi- 
mate [235]). SSC and IRR are defined as: 

Definition 3.2 (SUCCINCT SET COVER (SSC) [ ]). Given a set S = {0J of 3-DNF 

formulae such that \J ieS 4>i is a tautology, what is the size of the smallest S' C S such that 
Vies' & a tautology? 

-"^By on-set, we mean the set of assignments which cause 4>i to be true. 
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Definition 3.3 (IRREDUNDANT (IRR) [ ]). Given a DNF formula <p = tiVt 2 V - • -Vt n , 
what is the size of the smallest S C {tj}" =1 stzc/i that (ft = Vies^i ^ 

Our work introduces and studies quantum generalizations of SSC and IRR. In par- 
ticular, analogous to the classically important task of circuit minimization, the quantum 
generalizations we define are arguably natural and related to what one might call "Hamilto- 
nian minimization" - given a sum of Hermitian operators H = Y2i Hi, what is the smallest 
subset of terms {Hi} whose sum approximately preserves certain spectral properties of HI 
We hope that such questions may be useful to physicists in a lab who wish to simulate 
the simplest Hamiltonian possible while retaining the desired characteristics of a complex 
Hamiltonian involving many interactions. We remark that at a high level, the connection to 
cq-£ 2 for the task of Hamiltonian minimization is as follows: The classical existential proof 
encodes the subset of terms {Hi}, while the quantum for-all proof encodes complex unit 
vectors which achieve certain energies against H. The problem QUANTUM SUCCINCT 
SET COVER is now defined as follows. 

Definition 3.4. QUANTUM SUCCINCT SET COVER (QSSC) (informal) Given a set 
of local Hamiltonians {Hi} such that has smallest eigenvalue at least a, what is the 

size of the smallest subset S of the Hi such that ^2 H eS Hi has smallest eigenvalue at least 
a ? Any subset satisfying this property is called a cover. 

As defined in Section 1.5.4, a local Hamiltonian is a sum of Hermitian operators, each 
of which acts non-trivially on at most k e 0(1) qubits. Intuitively, the goal in QSSC is 
to cover the entire Hilbert space using as few interaction terms Hi as possible. Hence, 
we associate the notion of a "cover" with obtaining large eigenvalues, as opposed to small 
ones, making QSSC a direct quantum analogue of SSC. We remark that since SSC is a 
classical constraint satisfaction problem, we believe the language of quantum constraint 
satisfaction, i.e. Hamiltonian constraints, is a natural avenue for defining QSSC. Our first 
result concerns QSSC, and is as follows. 

Theorem 3.5. QSSC is cq-X 2 - complete, and moreover is cq-£ 2 -/iani to approximate within 
jyi-e j or a n e > ; where N is the encoding size of the QSSC instance. 

By hard to approximate, we mean that any problem in cq-£ 2 can be reduced to an instance 
of QSSC via a polynomial time mapping or Karp reduction such that the gap between the 
sizes of the optimal cover in the YES and NO cases scales as N l ~ e . In other words, it is 
cq-£ 2 -hard to determine whether the smallest cover size of an arbitrary instance of QSSC 
is at most g or at least g' for g'/g e ^(iV 1-6 ) (where g' > g). We next define the problem 
QUANTUM IRREDUNDANT (QIRR). 
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Definition 3.6. QUANTUM IRREDUNDANT (QIRR) (informal) Given a set of suc- 
cinctly described orthogonal projection operators {Hi} acting on N qubits, and {q > 0} C 
K 7 define H := '^2 i CiH i . Then, what is the size of the smallest subset S C {Hi} such that 
for H' = ^H.fzgCiHi, vectors achieving high and low energies against H continue to obtain 
high and low energies against H , respectively? 

Here, by a succinctly described projector, we mean a possibly non-local operator which 
is the tensor product of fc-local projectors for some k G This non-local structure 

naturally generalizes IRR, where the DNF formula is allowed to be non-local. Our next 
result is the following. 

Theorem 3.7. QIRR is cq-T l2 -hard to approximate within N^~ € for all e > 0, where N is 
the encoding size of the QIRR instance. 

Hardness of approximation for QCMA. The techniques from above can also be 
used to show hardness of approximation for QCMA. Here, the class QCMA [ ] is de- 
fined as cq-£ 2 with the second (quantum) proof omitted, and can hence be thought of as 
the first level of our "cg-hierarchy" . By defining the problem QUANTUM MONOTONE 
MINIMUM SATISFYING ASSIGNMENT (QMSA) (see Section 3.5), we show: 

Theorem 3.8. QMSA is QCMA- complete, and moreover is QCMA-hard to approximate 
within N 1 ^ 6 for all e > 0, where N is the encoding size of the QMSA instance. 

A canonical cq-£ 2 -complete problem. Our last results the canonical £ 2 -complete 
problem SjSAT and its generalization to the quantum setting. Specifically, given a boolean 
formula (f>, E,SAT asks whether: 

3xiVx 2 3x 3 ■ • ■ Vxj such that 0(xi, x 2 , x 3 , . . . , x,) = 1. (3.1) 

Here, we have assumed i is even; for odd i, the last quantifier is a 3. The terms Xj are 
vectors of boolean variables. For i = 2, one can define a natural quantum generalization of 
this problem, denoted cq-E2LH and defined in Section 3.6, using local Hamiltonians whose 
ground states are tensor products of a classical string and a quantum state. We show: 

Theorem 3.9. cq-S 2 LH is cq-E 2 - complete. 

Moreover, by defining an appropriate variant of cq-S 2 LH, denoted cq-S 2 LH-HW and also 
defined in Section 3.6, where the goal is to minimize the Hamming weight of the classical 
portion of the ground states mentioned above, we obtain the following result. 
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Theorem 3.10. cq-£ 2 LH-HW is cq-£ 2 -complete, and moreover is cq-Y,2-hard to approxi- 
mate within iV 1-6 for any e > 0, for N the encoding size of the cq-£ 2 LH-HW instance. 

Proof ideas: Our proofs are inspired by the classical work of Umans [235, 219], and are 
achieved in a few steps. First, we show a gap-introducing reduction from an arbitrary cq-£ 2 
problem to a problem we call QUANTUM MONOTONE MINIMUM WEIGHT WORD 
(QMW) using dispersers (see e.g., [ 28, 232]). We then show the following gap-preserving 
reductions, where <k denotes a mapping or Karp reduction: 

QMW < K QSSC < K QIRR . (3.2) 

This yields hardness ratios of N e for some e > 0. To obtain the stronger results claimed in 
Section 3.1, we finally apply the gap amplification of Umans [235] and improved disperser 
construction of Ta-Shma, Umans, and Zuckerman [232]. 

In the classical setting, Umans [235, 219] used dispersers to attain hardness of approx- 
imation results relative to £ 2 for the classical problems MMWW (the classical version of 
QMW), SSC and IRR. To extend his techniques to the quantum setting, the most involved 
aspects of our work are the gap-preserving reductions from QMW to QSSC to QIRR. Here, 
an intricate balancing act involving carefully defined local Hamiltonian terms is needed to 
construct operators with the spectral properties required for our reductions. To analyze 
the resulting sums of non-commuting Hamiltonians, we require heavier machinery, such 
as the specific structure of Kitaev's local Hamiltonian construction [ ], the Projection 
Lemma of Kempe, Kitaev, and Regev [163], and the Geometric Lemma of Kitaev [171]. 

Finally, to show cq-£ 2 -completeness of cq-£ 2 LH, we study the interplay between proofs 
of a classical-quantum structure and Kempe and Regev's [164] 3-local Hamiltonian con- 
struction. Specifically, a careful analysis reveals that any cq-£ 2 verification circuit can be 
modified in such a way that fixing the value c of its classical proof register leads to an 
effective Hamiltonian H c . We then study the spectrum of H c to achieve the desired result. 
Moving on to cq-X 2 LH-HW, hardness of approximation is now attained by combining our 
reduction for cq-£ 2 LH with the result that QMW is hard to approximate. 

Previous and related work: In terms of hardness of approximation, the related ques- 
tion of whether a quantum PCP theorem holds is currently one of the biggest open problems 
in quantum complexity theory (see, e.g., [6, 17, 26, 134]). Regarding quantum generaliza- 
tions of PH, the only previous work we are aware of is that of Yamakami [260]. However, 
the results of Yamakami are largely unrelated to ours (for example, complete problems are 
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not studied), and the proposed definition of Reference [260] differs from ours in a num- 
ber of ways: It is based on quantum Turing machines (whereas we work with quantum 
circuits), allows quantum inputs (whereas here, like QMA, the input to a problem is a clas- 
sical string), and considers quantum quantifiers at each level of the hierarchy (whereas in 
its full generality our scheme allows alternating between classical and quantum quantifiers 
between levels as desired). 

Significance and open questions: The classical polynomial hierarchy plays an impor- 
tant role in classical complexity theory, both as a generalization of NP and as a proof 
tool in itself. It is hoped that the scheme we propose here for generalizing PH to the 
quantum setting will find similar applications in quantum complexity theory. Second, the 
problems we show to be cq-E 2 -complete here are arguably natural, and in embodying a 
generalization of classical circuit minimization or optimization, may hopefully be related to 
practical scenarios in a lab. Further, although the alternation between classical and quan- 
tum quantifiers in cq-E 2 may a priori seem odd, the notion of relating a classical proof 
to, say, subsets of local Hamiltonian terms, and the quantum proof to quantum states 
achieving certain energies is in itself quite natural, and in our opinion justifies the study of 
such a combination of quantifiers. Third, with respect to hardness of approximation, since 
whether a quantum PCP theorem holds remains a challenging open question, it is all the 
more interesting that one is able to prove hardness of approximation in a quantum setting 
here using an entirely different tool, namely that of dispersers. We remark that dispersers 
and their two-sided analogues, extractors, have been used classically to amplify existing 
PCP inapproximability results [' 28, 263]. However, as far as we are aware, neither are 
known to directly yield PCP constructions. 

We leave a number of questions open: What other natural problems are complete for 
cq-£ 2 or higher levels? Can we say anything non-trivial about the relationship between 
Sf and cq-£ 2 ? How do the different classes cq-£ 2 , qc-S 2 , qq-S 2 , and cc-S 2 relate to 
each other? Where do the quantum hierarchies obtained by extending cq-S 2 to higher 
levels sit relative to known complexity classes? We hope the answers to such questions 
will help establish classes like cq-S 2 as fundamental concepts in the setting of quantum 
computational complexity. 

Organization of this chapter: We begin in Section 3.2 by formally defining the classes 
and problems studied in this chapter. In Section 3.3, we prove that QSSC and QIRR are 
hard to approximate for cq-S 2 within N e ; this is further improved in Section 3.4. Section 3.5 
presents hardness of approximation results for QCMA. We close in Section 3.6 by showing 
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cq-£2-completeness of cq-X^LH and cq-X2-hardness of approximation for cq-X^LH-HW. 

3.2 Definitions 

We now define relevant classes and problems, and state lemmas which prove useful in our 
analysis. Throughout our discussion, recall that B := C 2 , and for a set S of matrices over 
C,]etH 8 := E Hi esHi. 

We begin with a formal definition of cq-X2. Recall that a promise problem is a pair 

A = (A ycs , A ao ) such that A yes , A no C {0, 1}* and A yes n A no = 0. 

Definition 3.11 (cq-X^)- Let A = (A yes ,A no ) be a promise problem. We say that A G 
cq-^2 if there exist polynomially bounded functions t, c, q : N > N, and a deterministic 
Turing machine M acting as follows. For every n-bit input x, M outputs in time t(n) a 
description of a quantum circuit V x such that V x takes in a c(n)-bit proof \c) , a q(n)-qubit 
proof \q) , and outputs a single qubit. We say V x accepts \c)\q) if measuring its output qubit 
in the computational basis yields 1. Then: 

• Completeness: If x G A yes , then 3 |c) such thatW \q) , V x accepts \c)\q) with probability 

> 2/3. 

• Soundness: If x G A no , then V \c), 3 \q) such that V x rejects \c)\q) with probability 

> 2/3. 

Note that the completeness and soundness parameters can be amplified to values exponen- 
tially close to 1. Specifically, we use the standard approach of repeating V x polynomially 
many times in parallel (see "Error reduction for QMA" in Section 1.5.2), except that we 
only need one copy of the classical register C for all parallel runs. For any value c placed 
in C, we think of it as being "hardwired" into V x , thus obtaining a quantum verification 
circuit V X)C , which we now apply in parallel to the many copies of the quantum proof. The 
standard weak error reduction analysis for QMA now applies (see, e.g. [ ]). Throughout 
this chapter, we refer to this as error reduction. 

We next define the terms cQMA circuit, monotone set, QMW, QSSC, and QIRR. 

Definition 3.12 (cQMA circuit). Let n,m G N + . A cQMA circuit V is a quantum circuit 
receiving n bits in an INPUT register and m qubits in a CHOICE register, and outputting 
a single qubit \a). We say: 
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• V accepts x G {0, l} n in INPUT if for all \y) G B® m in CHOICE, measuring \a) in 
the computational basis yields 1 with probability at least 2/3. 

• V rejects x G {0, l} n in INPUT if there exists a \y) G B® m in CHOICE such that 
measuring \a) in the computational basis yields with probability at least 2/3. 

Definition 3.13 (Monotone set). A set S C {0, l} n is called monotone if for any x G S , 

any string obtained from x by flipping one or more zeroes in x to one is also in S . 

Definition 3.14 (QUANTUM MONOTONE MINIMUM WEIGHT WORD (QMW)). 

Given a cQMA circuit V accepting exactly a non-empty monotone set S C {0, l} n , and 
integer thresholds < g < g' < n, output: 

• YES if there exists an x G {0, l} n of Hamming weight at most g accepted by V . 

• NO if all x G {0, 1}™ of Hamming weight at most g' are rejected by V . 

Note that clearly QMW G cq-S 2 . 

Definition 3.15 (QUANTUM SUCCINCT SET COVER (QSSC)). Let S := {Hi} be a 
set of 5-local Hamiltonians Hi acting on N qubits such that Yln es^-i oil for a > 0. 
Then, given (3 G M such that a — f3 > 1 and integer thresholds < g < g' , output: 

• YES if there exists S' C S of cardinality at most g such that ^2 Hi£S , Hi y cel. 

• NO if for all S' C S of size at most g' , Y2h es' has an eigenvalue at most (3. 

Any S' satisfying the YES case is called a cover. 

Note that requiring a — (3 G fi(l) above is without loss of generality, as any instance 
of QSSC with gap l/p(N) for p a polynomially bounded function can be modified to 
obtain an equivalent instance with constant gap by multiplying each Hi by p(N) [248] (see 
Section 1.5.4). 

Definition 3.16 (QUANTUM IRREDUNDANT (QIRR)). Given S := {c t Hi}, where 
each Hi acts on N qubits and is a tensor product of 5-local orthogonal projection operators 
and Ci > are real. Then, given a, /3 G R such that a — (3 > 1, and integer thresholds 
< g < g' , output: 

• YES if there exists S' C S of cardinality at most g such that for all \ip) G B® N : 



95 



- If Tr (H s \ip){ip\) > a, then Tr(H S '\^)(^\) > a, and 

- I/Tr(# s |^|) < P, then Tr(H s ,\^\) < f3. 

• NO if for all S' C S of cardinality at most g' , there exists a state \ip) G B® N with 
Tr{H s \ip){ip\) > a and Tr{H s <\ip){i\) < (3. 

Roughly, QSSC asks how many local interaction terms in a local Hamiltonian one can 
discard while maintaining the value of the worst assignment. This is intended to mimic 
the idea of maintaining a tautology for a 3-DNF formula in SSC classically. Analogous to 
the relationship between SSC and IRR, QIRR allows possibly non-local Hamiltonian terms 
so long as they have a succinct description (this generalizes the use of superconstant arity in 
IRR) and are projectors up to scalar multiplication (this generalizes the requirement that 
each term tj in IRR is an AND of variables). QIRR then asks how many interaction terms 
can be discarded in a sum of such Hamiltonian terms while ensuring that any assignment 
l^) achieves approximately the same value on both the original and modified Hamiltonians. 

Next, the key tool enabling the creation of a gap in our reductions is a disperser (see 
e.g. [228, 232]). 

Definition 3.17 (Disperser). Let G = (L,R,E) be a bipartite graph with \L\ = 2 n , \R\ = 

2 m and left-degree 2 d . Then, G is called a (k, e)-disperser if, for any subset V C L of size 
\L'\ > 2 k , L' has at least (1 — e) \R\ neighbors in R. Moreover, if for any pair (v,i) for 
v G L, one can compute the ith neighbor of v in time polynomial in n, then the disperser 
is called explicit. 

Finally, in this chapter we use the following useful known facts from local Hamiltonian 
complexity theory. To begin, we have two lemmas used to bound the eigenvalues of a pair 
of non-commuting operators. The first of these is the Geometric Lemma of Kitaev, which 
we stated as Lemma 1.8 in Section 1.5.5. The second is the Projection Lemma, stated 
below. 

Lemma 3.18 (Kempe, Kitaev, Regev [163], Projection Lemma). Let Y = Y 1 + Y 2 act on 
Hilbert space % = S + S 1 - for Hamiltonians Y\ and Y 2 . Denote the zero eigenspace ofY 2 
as S, and assume the Y 2 eigenvectors in S 1 - have eigenvalue at least J > 2 || Y\ \\ 00 . Then, 
for \{Y) the smallest eigenvalue ofY and Y\$ '■— n^l^Hs, 

x{Ylls) " j-nYW - X{Y) - x{Ylls) ■ (3 - 3) 

II 1 II oo 
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We next briefly review the elements of Kitaev's circuit-to-Hamiltonian construction [171] 
which play an important role in this chapter (see in Section 1.5.5 for an in-depth treat- 
ment). Given a cq-S 2 verification circuit V = Vl ■ ■ ■ V\ (where without loss of generality, 
each Vi is a one- or two-qubit unitary) acting on n proof bits (register A), m proof qubits 
(register B), and p ancilla qubits (register C), recall that this construction outputs a 5- 
local Hamiltonian H acting onA®B®C®D, where D is a clock register consisting of L 
qubits. We then have H := if in + H OVLt + H piop + H staD , for penalty terms as defined below: 

H in := I AB ® [yy\l){l\ c }j ®|0><0| B (3.4) 
#out := I A ®\Q)$\ Bl ®Ic®\L)(L\ D (3.5) 

L 

Hp VO p := yjf/j, where Hj is defined as (3.6) 
-\Vi ® - Md - \v] ® \j - + \l® (U)O'I + \j - 1)0' - M)d (3.7) 

L-1 

^stab := ^,c®5>l)(01k A+i . (3-8) 

i=i 

Above, the notation refers to the ith qubit of register A (similarly for B, C, D). For 
any prospective proof in Tr(if each penalty term has the following effect on 
the structure of H in ensures that at time zero, the ancilla register is set to zero as it 
should be for V. H ont ensures that at time step L of V, measuring the output qubit causes 
acceptance with high probability. H prop forces all steps of V appear in superposition in 
\ip) with equal weights. Finally, note that for H in , H out , and H prop above, time t in clock 
register D is implicitly encoded in unary as |l*0 i_t ) (for H stSL b above, register D is already 
explicitly written in unary); H sta b is thus needed to prevent invalid encodings of time steps 
from appearing in D. 

We use two important properties of this construction. First, the null space of H in + 
H piop + -f^stab is the space of history states, which for arbitrary AB are defined as 



'hist 



sfL 



1 L 

= J2 ^ ■ ■ ■ WUb ® |0)o ® \i) D - (3-9) 



For cq-E 2 circuits V, it is convenient to define for c G {0, l} n and \q) G B® m the shorthand 
l c '?)hist := l' ? /')hist f° r 1^) = l c )k)- The second important property of H we use is that its 
spectrum is related to V as follows. 
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Lemma 3.19 (Kitaev [171]). The construction above maps V to (H,a,b) satisfying: 



• If there exists a proof accepted by V with probability at least 1 — e, then 
achieves Tr(H\i/;) (i[)\ hiBt ) < a for a := ej (L + 1). 



3.3 Hardness of approximation for cq-S 2 

We now show hardness of approximation for cq-E2 for the problems QMW, QSSC, and 
QIRR. We begin with a gap-introducing reduction from an arbitary problem in cq-E 2 to 
QMW. We remind the reader that the hardness ratios obtained here are further strength- 
ened in Section 3.4. 

Theorem 3.20. There exists a polynomial time reduction which, given an instance of an 
arbitrary cq-E 2 problem, outputs an instance of QMW with thresholds g and g' satisfying 
g' I g G Q(N € ) for some e > 0, where N is the encoding size of the QMW instance. 

Proof. The reduction follows Theorem 1 of Umans [235] closely; the points where we devi- 
ate from [235] are explicitly noted. Let IT be an instance of an arbitrary promise problem 
A = (A yes , A no ) in cq-E 2 with encoding size n, and whose verification circuit V has a 
c(n)-bit existential proof register and a g(n)-qubit for-all proof register. We wish to map 
II to a cQMA circuit W for QMW such that W accepts strings of small or large Hamming 
weight depending on whether II G A ycs or II G A no , respectively. To do so, we follow [235] 
and construct an explicit (k, l/2)-disperser G = (L,R,E) with left-degree 2 d using Refer- 
ence [228], where \L\ = 2 C (™) +1 , \R\ = 2 k+d -°W, and k := 7 log c(n) for 7 G 6(1) to be set 
as needed. Note that the value of d depends on the specific disperser construction used 
- for the construction of | ] , we have d = 4k + O (log n) . Roughly, the idea of Umans 
is now to have L correspond to assignments for the c(n)-bit classical register of V, and 
R to assignments for the classical register of W (in the setting of [235], note that W is a 
classical circuit). We then encode assignments from L by instead choosing neighbor sets in 
R. By exploiting the properties of dispersers, one can ensure that the sizes of the neighbor 
sets in R chosen vary widely between YES and NO cases for IT. 

Specifically, imagine the vertices in L are arranged into a complete binary tree whose 
2c(n) i eaves denote the 2 C ^ possible assignments to V's classical register. For convenience, 
we henceforth use L to mean this tree. Now, let x G {0, l} c(n ) denote a leaf of L. Then, a 



• IfV rejects all proofs then H y bl for b G Q 
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Figure 3.1: Here, the string 11110 • • • in R encodes the string 000 in L. (Note: This graph 
is not a disperser, but nevertheless illustrates the encoding scheme.) 

subset of vertices R! C R is said to encode x if it contains the union of the neighbor sets of 
all vertices in the unique path from the root of L to x. Figure 3.1 illustrates this encoding 
scheme. How do the vertices of R then relate to Wl Each vertex r G R corresponds to an 
input bit of W - setting this rth bit to one means we "choose" vertex r. 

With the encoding scheme defined, we now construct the cQMA circuit W. Given y 
and \z) to its INPUT and CHOICE registers, respectively, W acts as follows: (a) If y 
corresponds to a subset R y C R such that \R y \ > \R\ /2, then W sets its output qubit 
to one. (b) If \R y \ < \R\ /2, then W first decodes R y to obtain the set of leaves L y C L. 
Roughly, it then outputs one if there exists x G L y causing n's verification circuit V to 
output one when fed the proofs x and \z). These last two steps require further clarification, 
which we now provide. 

First, given R y C R 1 decoding it to obtain the set of leaves L y C L might a priori require 
exponential time, as recall \L\ = 2 c(n ) +1 . This, however, is precisely where dispersers play 
their part: Since we set e = 1/2 in constructing our disperser, we know that for any S C R 
with IS' | < \R\ /2, there are at most 2 k = c(n) 7 vertices in L whose neighbor sets are 
completely contained in S. Thus, by starting at the root of L and performing a breadth- 
first-search down the tree (where we prune any branches along which we encounter a vertex 
whose neighbor set is not contained in R y , as by definition such vertices cannot encode any 
leaf x), we can efficiently decode R y to obtain L y while visiting only polynomially vertices 
in L. It remains to specify how W checks whether there exists an x G L y causing V to 
accept, and here we must deviate from Umans' construction. 

First, if \L y \ = 1, our task is straightforward - simply run V as a black box on proofs 
x £ L y and \z), and output the result. Then, W outputs one with probability at least 
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2/3 on input y for all quantum proofs \z) if and only if V also does so on proofs x and 
\z). If , however, \L y \ > 1, a more involved construction of W is necessary. Here, W takes 
three inputs: a classical description of V, an |i?|-bit string y to denote subsets in R, and a 
2 fc g(n)-qubit proof \z). Then, for the zth candidate string Xj G L y , W feeds a:, and the zth 
block of q(n) proof qubits of \z) into V . (If |LJ < 2 k , we simply re-use values of x <E L y in 
the leftover parallel runs of V.) W then coherently computes the OR of the output qubits 
of all parallel runs of V and outputs this qubit as its answer. 

Let us briefly justify why this works. For simplicity, assume the quantum proof to W 
can be written \z) = \zi) <8> • • • <S> \ z 2 k); entangled proofs can be shown not to pose a problem 
via the same proof technique used in standard error reduction [ ]. Now, if there exists 
an Xi G L y causing V to accept for all quantum proofs, then in the zth parallel run of V 
in W corresponding to X{, V outputs 1 with probability at least 2/3 on any \zi), implying 
W outputs 1 with probability at least 2/3. Conversely, if for all Xi G L y , there exists a 
quantum proof \zi) rejected by V, then by standard error reduction for V and the union 
bound, the state \z) = \z\) ® ■ ■ ■ ® \z 2 k) causes W to output 1 with probability at most 
1/3, as required. 

Following Reference [235] again, we now argue that W accepts a non-empty monotone 
set, and we analyze the hardness gap introduced by this reduction. The first of these is 
simple - namely, W accepts a set R' C R if either \R\ > \R/2\, in which case it also accepts 
any R" D R', or if R! encodes some x G L accepted by V, in which case any R" D R' would 
also encode x and hence be accepted. As for the gap, if x G L is an accepting assignment 
for V when II G A yes , then to encode x using a subset of R requires at most c(n)2 d vertices 
in R, where recall 2 d is the left-degree of our disperser. On the other hand, if II G A no , 
then the only way for W to accept is to choose R' C R with \R'\ > \R\ /2 w c(n)~ i 2 d . This 
yields a hardness ratio of f2(c(n) 7-1 ). Since W's encoding size N is polynomial in c(n), 
there exists some e > such that the ratio produced is of order N e , as desired. □ 

We next show a gap-preserving reduction from QMW to QSSC Its proof requires 
Lemmas 3.22 and 3.23, which are stated and proven subsequently. 

Theorem 3.21. QSSC is in cq-S2. Further, there exists a polynomial time reduction 
which, given an instance of QMW with thresholds f and f , outputs an instance of QSSC 
with thresholds g = f + 2 and g' = f + 2, respectively. 

Proof. That QSSC is in cq-S 2 follows using Kitaev's verifier [171] for putting k- local Hamil- 
tonian in QMA. Specifically, we construct a cq-S2 verification circuit for QSSC which takes 
a description c of some subset of local Hamiltonians S := {Hi} in its classical register, and 
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estimates the energy achieved by \q) in its quantum register against Hs using Kitaev's 
approach, outputting zero or one according to whether the measured energy is above or 
below the desired thresholds. 

To reduce QMW to QSSC, suppose we are given a cQMA circuit V accepting exactly 
a non-empty monotone set T C {0, l} n and threshold parameters / and /'. We assume 
without loss of generality that V is represented as a sequence of one and two qubit unitary 
gates Vi such that V = Vl---V\. We also assume using standard error reduction that if 
V accepts (rejects) input x 6 {0,1}", then it outputs one (zero) with probability at least 
1 — e := 1 — 2~ 4(n+m ). 

We now state our instance (S,a, (3,g,g') of QSSC as follows. We first apply Kitaev's 
circuit-to-Hamiltonian construction from Section 3.2 to V to obtain a 3-tuple (H,a,b). 
Note that H = Y^i=i Hi with r terms ^ -< I. Then, set a := 1 — (( + l)e, and 
C := 2(1 + 2 2(n+m ^)/(L + 1). Define /? := 1 - b. Note that for large n + m, this yields 
a > 1 - 2-( n+m ) and (3 < 1 - c(l - 2- (n+m ^)/L 3 for some constant c. Further, define 
g :— f + 2, g' := /' + 2, and let S consist of the elements (intuition to follow) 

G x := (L + l)|0)<0| Al ® I B ,c®\0)(0\ D (3.10) 



G n := (L + l)|0)(0| An ®/ B , c ®|0)(0| D (3.11) 
:= (A + l)(H in + H pmp + H stah ) (3.12) 

G n+ 2 '■= I — (i^in + H prov + -ffgtab + H out ), (3.13) 

for A > to be chosen as required, and where Ai denotes the zth qubit of register A. 
Intuitively, the terms in S play the following roles: G n+ \ penalizes assignments which are 
not valid history states. G n+ 2 penalizes valid history states accepted by V. Finally, the Gi 
for i G [n] penalize valid history states rejected by V (recall that V accepts a monotone 
set, and so flipping a one to a zero in register A may lead V to reject). Thus, we cover the 
entire space. We now make this rigorous. 

As required by Definition 3.15, we begin by showing that S itself is a cover, i.e. that 
Gs ^ ®Ia,b,c,d- First, note that 

« 

G S = I + G * ~ + A (#- + #P'°P + F ^ab)- (3.14) 
i=l 

It thus suffices to prove that for large enough A, 

A(F in + H piop + # stab ) +[J2gA- H ont y -(C + l)eJ. (3.15) 
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To show this, we use Lemma 3.18, the Projection Lemma, with 

Intuitively, the Projection Lemma tells us that by increasing our weight A, we can force the 
smallest eigenvalue of Yi + Y 2 to be approximately the smallest eigenvalue of Y\ restricted to 
the null space of Y 2 . In our setting, this implies it suffices to study the smallest eigenvalue of 
Yi restricted to the space of all valid history states, i.e. states of the form of Equation (3.9). 
Let i5>hi St denote the space of valid history states; note <5>hist is the null space of H in + 
-ffprop + #stab- Then, in the notation of Lemma 3.18, to lower bound A(Y 1 |,5 h . st ), we invoke 
Lemma 3.23 to instead upper bound the largest eigenvalue of (— Yi)|s hist - This yields 
A(Yi|s hist ) > — (e. Noting that || Y\ < n(L + 1) + 1, and since by Lemma 3.22 the 
smallest non-zero eigenvalue of Y 2 scales as f2(A/L 3 ), it follows by Lemma 3.18 that by 
setting A e f2(n 2 L 5 /e), we have Y\ + Y 2 >z — (C + l) 6 -^ as desired. This completes the proof 
that S is a cover. 

We now show the desired reduction. Assume first that V accepts a string x of Hamming 
weight k, and let T C [n] be such that i G T if and only if x« = 1. We claim there exists 
a cover 5" C S of size \S'\ = k + 2 which consists of G n +i, G n+2 , and the k terms Gi such 
that i G T. To show this, following the proof above, the analogue of Equation (3.15) which 
we must prove is 

A(H- m + # pr o P + tfstab) + f |] Gi \ - H out h -(C + l)e/. (3.17) 
First, applying Lemma 3.23 again, we lower bound the smallest eigenvalue of 




(3.18) 



restricted to 5hi S t by —(e. Since || Y[ \ OQ < || Yi for Y\ from the previous case of T = [n], 
the value of A from before still suffices to apply Lemma 3.18 and conclude that Equa- 
tion (3.17) holds, as desired. 

Conversely, suppose V rejects any string x of Hamming weight at most k. For any 
S'C5 with |S"| < k + 2, we claim that G$' has an eigenvalue at most f3. To see this, note 
first that if G n+2 S', then the state |l n ,2/) hist attains expected value zero against Gs', 
where note (3 > 0. Similarly, if G n+ \ S', then the state \l n ) A BC ® |0)d obtains expected 
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value at most zero against G. We conclude that in order to refute the claim that G has 
an eigenvalue at most 0, we must have G n+ i,G n+ 2 £ S'. This implies that S' contains at 
most k terms Gi for % G [n]. Then, consider the string x which has ones precisely at these 
at most k positions i G [n] corresponding to Gi G S'. It follows that the state \x,y) hist lies 
in the null space of all terms in S' with the possible exception of G n+ 2- Moreover, since V 
rejects all strings of Hamming weight at most k, there exists by the definition of a cQMA 
circuit and Lemma 3.19 a \y) G B® m such that 

Tr (G n+2 \x, y) (x, y| hist ) = 1 - Tr (H\x, y) (x, y\ hist ) < 1 - b = 0, (3.19) 

completing the proof. □ 

The following two lemmas are required for the proof of Theorem 3.21. Their statements 
and proofs assume the notation of Theorem 3.21. 

Lemma 3.22. The smallest non-zero eigenvalue ofY 2 = A(H in + H prop + H stah ) scales as 
tt(A/L 3 ). 

Proof. We bound the smallest non-zero eigenvalue of H in + H prop ; it is straightforward to 
show using the approach of Reference [ I 71] that the addition of i? sta b does not affect this 
lower bound (see Section 1.5.5). Our proof idea here is to "lift" the null space of H m + H wov> 
so that the smallest non-zero eigenvalue of H in + H pTop becomes the smallest eigenvalue of 
the lifted operator, and then apply the Geometric Lemma (Lemma 1.8) to lower bound 
the latter. 



To begin, recall that the null space of H in + H pmp consists of all valid history states 

1 L 

= • • • Vx\if>) AiB ® |0) c ® \i) D , (3.20) 



i=0 



for any \i[)) AB - (Since we omit H stah for now, we assume here that the clock register is 
represented in binary, i.e. there are no invalid clock states.) As done in Reference [171] 
and Section 1.5.5, our analysis is simplified by first applying the unitary change of basis 
W = Y,UVI--V]®\j){jl yielding 

W\^} Mst = |V>. B ® |0} c ® h) D (3.21) 



WH in W^ = H in = I A>B ® I 2J|1><1| C 1 ®|0)(0| o (3.22) 
WH prop W^ = I AB ®I C ®E D (3.23) 
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where (7) := ( J2i=o l*)J> anc ^ f° r some operator E D whose eigenvalues are given by 
Afc = 1 — cos(irk/(L + 1)) for < k < L and whose unique zero-eigenvector is I7). 

As alluded to above, we now lift the null space of W{H m + H prop )W^ . Letting n nist 
denote the projector onto the space of valid history states \4>) hisV this is accomplished by 
defining 

A 1 := W(H in + pU hist )W^ (3.24) 
A 2 := W(H piop + 2U hist )Wl (3.25) 

Note that [H in ,IL Ust ] = [H pTop , IL Ust ) = 0, H^mlL < V and || F prop < 2. It thus 
remains to lower bound the smallest eigenvalue of A\ + A 2 , for which we apply Lemma 1.8 
(Geometric Lemma) to A\ + A 2 via the approach of Reference [171]. For this, we require 
values for the parameters v and a(C±, C 2 ). 

For v, note that since A\ is a sum of commuting orthogonal projectors, its smallest 
non-zero eigenvalue is at least 1 (assuming p > 1). Similarly, one infers from the spectrum 
of Ed stated above that the smallest non-zero eigenvalue of A 2 scales as Q(l/L 2 ). It follows 
that v e Q(l/L 2 ). As for a(Ci, C 2 ), note that the null spaces £1 and C 2 can be written as 

A = ^S +m) ®span(|^) : (i/)\Q • • • 0) = 0) o <g> span(|l), . . . , \L)) D © (3.26) 
^g +m) ®|0---0) c ®span(|^) : (#y) = 0) D , (3.27) 
C 2 = ^g +m) ®span(|^) : (i>\0 ■ ■ ■ 0) = 0) o <g> \j) D - (3.28) 



Observe that C\ fl C 2 = {0}, as required by Lemma 1.8. Then, letting LT^ denote the 
projector onto C\, we analyze 

cos 2 a(£i,£ 2 ) = , max \(x\y)\ 2 = max (y\U Cl \y) = max (y\IL 1 + IL 2 \y), 

unit \x)eCi,\y)eC2 unit \y)eC'2 unit \y)&C2 

(3.29) 

where Hi and H 2 project onto the spaces in Equations (3.26) and (3.27), respectively. 
As (y\n. 2 \y) = 0, we simply need to maximize which is equivalent to maxi- 

mizing |('?/'|7')| 2 for any unit vector in register D and for unnormalized state I7') : = 
( v^E+T ^2i=i I*))- By the Cauchy-Schwarz inequality, this quantity is upper bounded by 
Lj (L + 1). We thus obtain the bound cosa(£i, C 2 ) < yjL/(L + 1). Combining this with 
the identity 2 sin 2 | = 1 — cos x and the Maclaurin series expansion for y/1 + x (where 
< 1) yields 2 sin 2 a ( £ i' £2 ) > ^ ' , Substituting into Lemma 1.8, the desired result 



\x\ 



follows. □ 
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Lemma 3.23. Define n hist := E^{o,i}" l!; e{o,i} ra \ x , y) i x , y\hist as the projector onto S hist , 
let ( := 2(1 + 2 2 ( ra+m ))/(L + 1), and consider T C [n]. Then, if V outputs one with 
probability at least 1 — e for inputs (x, \y)) with x e {0, l} n s«c/i that X{ = 1 /or all i E T 
and for all m-qubit \y), one has 



n 



hist 



H ut — 



n hist =< Cel. (3.30) 



Proo/. Define Z 1 := Il hist (- ^ ieT G *) n hist and Z 2 := n h i S tiJ out n h i S t. Letting z G {0,1}™ 
denote the characteristic vector of T, i.e. the ith bit of z is set to one if and only if 
i e T, it follows that any state |x, y) hist is an eigenvector of Z 1 with eigenvalue — \T\. 
Hence, for example, Tr (Zi\l n , y)(l n , y\ hist ) = 0. Further, since V accepts a non-empty 
monotone set, it must accept input (l n , \y)) with probability at least 1 — e, implying 
Tr(Z 2 |l n , y) (1™ , y| hist ) < -j^py - This yields an upper bound of 

Tr((Z 1 + Z 2 )|l", 2/ )(l",y| hist ) < ^ (3.31) 

in this simple case. We now show that deviating from |l n ,y) hist above cannot increase our 
expected value against Z x + Z 2 by "too much" . 

To do so, let |0) = ai|0i)+a: 2 |0 2 ) be an arbitrary valid history state where |cti 1 2 + 1«2 1 2 = 
1, |0i) is a (normalized) superposition of valid history states where each history state in 
the superposition has a string x in register A at time zero satisfying x^ — 1 if % e T, and 
where \<f> 2 ) is a valid history state in the space orthogonal to space of all possible states 
|0i). We thus first have that 

Tr (^10X01) < + a 2 Tr (Z 1 |0 2 )(0 2 |) < a 2 2 [(\T\ - 1) - |T|] < - \a 2 \ 2 . (3.32) 

Moving on to Z 2 , observe that straightforward expansion yields 

Tr(Z 2 |0)(0|) = I^^TrC^l^iX^I) + |« 2 1 2 Tr(^ 2 10 2 > (0 2 1 ) (3.33) 
+ a^Tr^l^^D + at^Tr^l^)^!). (3.34) 

To upper bound this quantity, we use the fact that (a\b) + (b\a) < (a\a) + (b\b) for complex 
vectors \a) and \b). Namely, setting \a) := aiV^l^i) and \b) := a 2 \^Z 2 \(p 2 ) yields 

Tr(Z 2 |0)(0|) < 2|a 1 | 2 Tr(Z 2 |0 1 )(0 1 |) + 2|a 2 | 2 Tr(Z 2 |0 2 )(0 2 |) (3.35) 
< 2|a 1 | 2 Tr(Z 2 |0 1 )(0 1 |) + 2|a 2 | 2 -^-, (3.36) 

Li + 1 
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where the second inequality follows since || Z 2 \ oa < 1/(L + 1). Finally, in order to up- 
per bound the term Tr(Z 2 |0i)(0i|) in Equation (3.36), observe that since by assumption 
Tr(Z 2 \x,y){x,y\ hist ) < for all x with Xi = 1 for i G T, and since H out is a projector, it 
follows that the norm of H out \x,y) Mst is at most ^e/(L + 1). Using the Cauchy-Schwarz 
inequality, this implies that each cross term in the expansion of Tr(Z 2 |0i)(0i|) can con- 
tribute a value of magnitude at most e/(L + 1). Since there are at most 2 2 ( n+m ) such cross 
terms, and since the non-cross terms are weighted by a convex combination, we hence have 
the upper bound of Tr(Z 2 |0i)(0i|) < (1 + 2 2 ( n+m ))e/(L + 1). Combining these bounds, we 
have 

Tr((Zi + Z 2 )|0)(0|) < -\a 2 \ H — — h y— (3.37) 



L+1 L+1 

2 |«i| 2 (1 + 2 2 (™ +m ))e + \a 2 \ 2 (1 — L) 



(3.38) 



L + 1 

2(1 + 2 2 < n+m )) 

< 1 L + 1 } e (3.39) 

= Ce (3-40) 

where the second inequality holds when L > 1. □ 



Finally, we show that QIRR is cq-£ 2 -hard to approximate. 

Theorem 3.24. There exists a polynomial time reduction which, given an instance of an 
arbitrary cq-E 2 problem IT, outputs an instance of QIRR with threshold parameters h and 
h' satisfying h'/h e Q(N e ) for some e > 0, where N is the encoding size of the QIRR 
instance. 



Proof. We begin by applying Theorems 3.20 and 3.21 to reduce the instance of II to an 
instance (S = {Gj}™^ 2 ,a, f3, g, g') of QSSC, and henceforth assume the terminology and 
definitions introduced in Theorem 3.21. Recall that any cover in this QSSC instance must 
include the terms G n+ \ and G n+2 . For ease of exposition, we first reduce this instance 
to QIRR with parameters h = g + 2r — 3 and h' = g' + 2r — 3, where recall r is the 
number of terms in H = Yll=i Hi- This, however, does not suffice to obtain a hardness 
of approximation gap, as tracing through Theorems 3.20 and 3.21 yields r 6 u(g),u(g'), 
implying h'/h — y 1 as the instance II in Theorem 3.20 grows in size. We then slightly 
modify our reduction to improve the threshold parameters to h = gr — 1 and h' = g'r — 1, 
which yield the desired hardness of approximation gap. 
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We now state our instance (T, 7, S, h, h') of QIRR, and follow with an intuitive expla- 
nation. For simplicity of exposition, we assume r is a power of two, but our construction 
can be easily modified to handle the complementary case. We also label H r = H out . We 
now introduce three registers: a "tag" qubit register (denoted A), the space the original 
cover S acts on (denoted B), and logr "chaperone" qubits (denoted C). The Hamiltonian 
terms we define for QIRR, T := {F i }£!' 1 2r ~ 1 , act on A® B® C = B® B^ n+m+p+q ^ <g> i3® logr , 
and are defined as: 

Fi ■= |0)(0| A (g)(G' 1 ) B ®7 c (3.41) 



F n := |0>(0U<8> (G n ) B ®I c (3.42) 
F n+1 := (A + 1)[\0)(0\ A ®(H 1 ) B ®I C + \1)(1\ A ®I B ®\0)(0\ C ] (3.43) 



ci 



F n+r _! := (A + l)[\0)(0\ A ®(H r _ 1 ) B ®I c + \l)(l\ A ®I B ®\r-2)(r-2 
F n+r : = \0){0\ A ®(I-H 1 ) B ®I c + \l){l\ A ®I B ®\r-l)(r-l\ c (3.44) 



F n+2r -i ■= \0){0\ A ®(I-H r ) B ®I c + \l){l\ A ®I B ®\r-l){r-l\ c . (3.45) 

We set 7 := a+r-1, 5 := fi+r-1, h := £(+2r-3, and h! := ^'+2r-3. Note that each Fj is 
a projection up to scalar multiplication, as required. We now provide the intuition behind 
the construction. QIRR is stated in terms of projectors Fj (up to scalar multiplication), 
whereas QSSC is stated in terms of Hermitian operators Gi. Hence, in order to move 
from the latter to the former, a natural idea is to treat each local Hamiltonian term in the 
sums comprising G n+ i and G n+2 as distinct terms F n+1 , . . . , F n+r _i and F n+r , . . . , F n+2r -i, 
respectively. The problem with this approach is that in order to rigorously argue that the 
gap between thresholds g and g' for QSSC is preserved when defining thresholds h and 
h! for QIRR, we would like, for example, that all terms Fj making up G n+ \ are chosen 
together in any candidate cover T' C T. To address this issue, we introduce the chaperone 
qubits, which ensure that any candidate T' plays by these rules. In particular, we can 
make sure that all terms F n+1 , . . . , F n+2r -i are chosen in any T' , allowing us to rigorously 
apply our knowledge of the spectra of G n+ i and G n+2 to the analysis of Ft versus Ft>- 

We now show that if there exists a cover S' C S for QSSC of size v, then there exists a 
V C T such that \T'\ = v + 2r — 3 satisfying the conditions for a YES instance of QIRR. 
Namely, let 

T ' = {Fi} ie[n] and Gi€S > u {F n +i, • • • , F n+2 r-i}- (3.46) 
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Note that it suffices to show that Ft> 7/ (since if Ft> t: jl, then Ft b J I as well). To 
show this, observe first that we can write Ft> — K\ + K2, for K\ and K2 defined as: 



r-1 



Ki := |0)<0| x ® G i + (A + l)Y,H i + ^2(1- Hi) \ ®J C (3.47) 

\ie[n] and Gi£S' »=1 i=l / B 

= |0)(0| A ®(G S / + (r-l)/) B ®/ o (3.48) 
K 2 := |1)<1| A ®I B <8> ^(A + l) ^|*)<i|J +r|r-l)(r-l|J (3.49) 



r-2 



|1)(1| a ®/b® r/ + (A + l-r)^)|i)(i| , (3.50) 



where we can assume without loss of generality that A > r — 1. Let \<fi) = a \0) A \<fio) bc + 

A |0i) bc* De an arbitrary state acti 
unit vectors |0o)bc anc ^ I^i)bc- Then 



ai|l) A |0i) bc De an arbitrary state acting on this space with |ao| 2 + |ai| 2 = 1 and for some 



Tr(F T ,|0>(0|) = Tr{K x \(j>){<l>\)+Tr{K 2 \<j>){<i>\) (3.51) 

= Kl'Tr^lOXOl <g> \<h)(<h\) + l«i| 2 Tr(K 2 |l)(l| ® |0i)(0i|) (3.52) 

> \a Q \ 2 (a + r — 1) + \ai\ 2 r (3.53) 

> 7, (3.54) 

where the first inequality follows since Tt(XabIa ® Vb) = Tr(Tr A (X AB )Ye) and since G5/ 
is a cover by assumption, and the second inequality since < a < 1. We conclude that 
if T / ^ 7 J, as desired. 

We now prove the other direction, namely that if there does not exist a cover S' C S 
for QSSC of size v , then all subsets T' C T of size |T'| = t> + 2r — 3 satisfy the conditions 
for a NO instance of QIRR. To see this, note first that any candidate T' must include the 
terms Fi for n + l<i<n + r — 1. This is because if, for example, F n+X ^ T', then vector 
\<p) := |1) A |^) S |0) C obtains expected value A + 1 > 7 against F T , but \<p) is orthogonal to 
Ft'. A similar argument holds for the terms Fi with indices n + r<i<n + 2r — 1, since 
state |0) := |l) A |V , )sl r — l)c obtains expected value r > 7 against Ft, but obtains value 
at most r — 1 < 5 against Ft> if there exists an % G [n + r, n + 2r — 1] such that i (jL T' ' . 
Thus, for any candidate T' of size v + 2r — 3, this leaves v — 2 terms to be chosen from 
{Fi, . . . , F n }. If we now restrict ourselves to states of the form |O) A | , 0) BC ,, we find that we 
are reduced to the same argument in the NO direction of Theorem 3.21 - namely, as S is 
a cover and any S' C S of size t> is not a cover, there must exist a state |0) := \0) A \ip) BC 
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such that 

Tr(\<f>)(<f>\F T ) = Tr [Tr c (\^)(i>\)(G s + (r - 1)1)] > a + (r - 1) > 7, (3.55) 

whereas 

Tr(|0)(0|F T ,) = Tr [Tr c (|^|)(GV + (r - 1)/)] < /3 + (r - 1) = 5. (3.56) 

This concludes the reduction from QSSC to QIRR with parameters h = g + 2r — 3 and 
ti = g' + 2r - 3. 

To obtain improved parameters h = gr — 1 and h' = g'r — 1, we modify the construction 
above as follows (intuition to follow): The terms Fi for n + 1 < n + 2r — 1 from the old 
construction remain unchanged. For i G [n], we replace each F« := |0)(0| A <8> (Gj)s Cg> Ic 
with the r distinct terms: 

Fi,i :=|0)(0U®(G 4 )b®|0)(0| c , (3.57) 
F, 2 :=|0)(0| A «)(G i ) B ®|l)(l| c , (3.58) 

Fi,r :=|0)<0| x ®(GOfl®|r-l)(r-l| c . (3.59) 

Thus, the total number of terms in our QIRR instance increases from n+2r— 1 to r(n+2) — 1. 
Intuitively, we have used the chaperone qubits to split each Fi into r terms Fi j, such that 
if in the old construction we chose Fi G T', then in the new construction we must place 
all r terms F it j in T' in order for the new F?> to maintain its desired spectrum. Thus, 
whereas the old construction chose g — 2 terms Fi to place in T", the new construction 
chooses r(g — 2) terms Fij to place in T", yielding the desired thresholds h = gr — 1 and 
ft' = 3V - 1. 

The completeness and soundness proofs now follow similarly to the previous case. 
Namely, given a cover 5" C S for QSSC of size v, the set T' C T with |X"| = — 1 
we choose is 

T ' = { F i,j}ie[n} and G i6 5' J6[r] U • • • » ^n+2r-l}- (3.60) 

Since Ft> in this new reduction is precisely Ft> in the old reduction, the remainder of this 
direction proceeds identically. Conversely, if there does not exist a cover S' C S for QSSC 
of size v , we similarly first argue that Fi for ri + l<i<n + 2r — 1 must be chosen in 
any candidate T 1 C T of size |T'| = tr — 1, leaving r(v — 2) terms to be chosen from 
{Fi t i, . . . , F n/r }. This implies that for any such T', there must exist a j G [r] such that the 
number of terms F^j in T' is at most v — 2. Since no cover of size v exists for our QSSC 
instance, we conclude there exists an appropriate choice of \<f>) := \0) A \?P) B \j) c such that 
Equations (3.55) and (3.56) still hold. □ 
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3.4 Improvements to hardness gaps 



We now improve the hardness gaps of Theorems 3.20, 3.21, and 3.24 to obtain the results 
claimed in Theorems 3.5 and 3.7. The key idea is to use the fact that the gap for QMW 
from Theorem 3.20 can be amplified by composing the cQMA circuit W with itself. The 
results here adapt Section 5 of [235] in a simple manner to the quantum setting. 

Specifically, assume for the moment that the output qubit of W is actually a classical 
bit, i.e. that the output qubit is given after being measured in the computational basis. 
Then, one can recursively define W l := W and W l as W l ~ x with n independent copies 
of W at each of its n INPUT bits. (Note that entanglement between quantum proofs for 
different copies of W does not affect the soundness of W*, as each W outputs a classical 
bit, and no quantum proofs are reused.) Now, such a recursive composition of W can 
easily be made well-defined even if H^'s output qubit is a superposition of |0) and |1) using 
the principle of deferred measurement [200] - namely, without loss of generality, we can 
assume W first copies its n classical INPUT bits to an ancilla, and henceforth acts only 
on its CHOICE and ancilla registers. Thus, the output qubit of each copy of W in W t 
is effectively used only as a classical control in the remainder of the circuit, and so the 
measurement of all output qubits can be deferred to the end of W l . Finally, since we can 
assume using standard error reduction that the completeness and soundness error of W 
scale as 2 _n , it follows by the union bound that with probability exponentially close to 1, 
all the W circuits comprising W l output the correct answer. In other words, with high 
probability, one can think of W l as a composition of zero-error circuits W (where zero-error 
means zero completeness and soundness error). With this viewpoint, the proof of Lemma 
3 of Reference [235] directly yields the following result in the quantum setting. 

Lemma 3.25. If W is a cQMA circuit accepting exactly a monotone set, it follows that: 

1. \W l \ < n* \W\, where \W\ denotes the size of W , 

2. W accepts an input of Hamming weight k if and only if W l accepts an input of weight 
k\ 

3. W l accepts exactly a monotone set. 

To improve the hardness gap of Theorem 3.20, we now simply replace the cQMA circuit 
W constructed in the proof of Theorem 3.20 with W l for an appropriate choice of t. The 
details and resulting analysis follow identically to the proof of Theorem 4 of Reference [235] , 
which combined with the improved disperser construction of Reference [232] (see Theorem 
7.2 therein) yields: 
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Theorem 3.26. QMW is c<\-Y>2-hard to approximate with gap N 1 6 for any e > 0, for N 

the encoding size of the QMW instance. 

Using this as the starting point in our reduction chain to QSSC and QIRR, a closer 
analysis of the proofs of Theorems 3.5 and 3.7 now yields: 

Corollary 3.27. QSSC and QIRR are cq-Y>2-hard to approximate with gaps iV 1_e and 
Nz~ t for any e > 0, respectively, and where N is the encoding size of the respective QSSC 
and QIRR instances. 

3.5 Hardness of approximation for QCMA 

We now briefly remark that the approach of Theorems 3.20 and 3.26 can be adapted 
to show hardness of approximation for QCMA. Our result is a straightforward extension 
of Umans' classical result [235] showing NP-hardness of approximation for the problem 
MONOTONE MINIMUM SATISFYING ASSIGNMENT. 

Specifically, define the problem QUANTUM MONOTONE MINIMUM SATISFYING 
ASSIGNMENT (QMSA) analogously to QMW, except with the definition of a cQMA 
circuit V modified to drop the second (quantum) proof, i.e. V now only takes one input 
register comprised of n classical bits. (For example, Definition 3.12 is modified to say that 
V accepts x £ {0, l} ra in INPUT if measuring \a) in the computational basis yields 1 with 
probability at least 2/3.) Then, it is straightforward to re-run the proofs of Theorems 3.20 
and 3.26 without the existence of a second quantum proof register, leading to Theorem 3.8. 

3.6 A canonical cq-E2-complete problem 

In this section, we first show that a quantum generalization of the canonical Sg-complete 
problem S 2 SAT, denoted cq-S 2 LH, is cq-S 2 -complete. We then observe that a similar proof 
yields cq-S2-hardness of approximation for an appropriately defined variant of cq-X^LH. 

Definition 3.28 (cq-£ 2 LH). Given a 3-local Hamiltonian H acting on N = n + m qubits, 
and a, b £ K such that a < b for b — a > 1, output: 

• YES if '3 x £ {0,l} n such thatV \y) £ B® m , Tr(H\x){x\ ® \y)(y\) > b. 

• NO i/V x £ {0, l} n , 3 \y) £ B® m such that Tr(H\x){x\ ® \y)(y\) < a. 
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Theorem 3.29. cq-X 2 LH is cq-£ 2 - complete. 

Proof. That cq-£ 2 LH G cq-£ 2 follows from Kitaev's verifier for placing fc-local Hamiltonian 
in QMA [171] (see Section 1.5.5). As for cq-X 2 -hardness, for simplicity we show the result 
for the case of cq-£ 2 LH defined with 5-local Hamiltonians. The proof for the 3-local case 
follows identically by instead substituting the 3-local circuit-to-Hamiltonian construction 
of Reference [164] below (this is possible because our proof does not exploit the structure 
of the clock register or H stah ). 

To see that any instance II of a problem in cq-S 2 reduces to an instance of cq-S 2 LH, 
let V" denote the cq-S 2 verification circuit for II. Recall that V" acts on a classical proof 
register A, a quantum proof register B, and an ancilla register C. We begin by modifying 
V" to obtain a new equivalent circuit V which first copies the (classical) contents of A 
to its ancilla register C, and henceforth acts on this copied proof in C throughout the 
verification. This ensures the contents of A remain unchanged during the verification. 
Next, we modify V to obtain V by concatenating to its end a Pauli X on the output 
qubit; this swaps the cases in which V accepts and rejects, respectively. This is necessary 
because if |c) <g> \q) is accepted by V, then \c,q) hist obtains low energy against Kitaev's 
Hamiltonian, whereas in our YES instance here we require high energy. Finally, we apply 
Kitaev's circuit-to-Hamiltonian construction from Section 3.2 on V to obtain a 5-local 
Hamiltonian H. 

Suppose now that we have a YES instance of n, i.e. there exists bit string |c) such that 
for all quantum states \q), the circuit V" accepts proof |c) <8> \q) with probability at least 
1 — e (and hence V rejects |c) £g> \q) with probability at least 1 — e). We show that for all 
\if)) B CD , the state \c) A ®\ip) B c D attains expectation value at least b against H, for b from 
Lemma 3.19. In other words, letting n c := (|c)(c| A <g> Ib,c,d), we claim 

(c| ® (V>|#|c) ® \ip) = (c| g> {i)\Yi c HYi c \c) ® |^> > b. (3.61) 

To see this, observe first that 

n c ifstabn c 
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c){c\ A ®I B ® \J2\l)(l\ Ci j ®|0}(0| D =: \c)(c\ A ®H! Q , (3.62) 

c) (c\ A <g> \0)(0\ Bi ® I c ® \L) (L\ D =: \c) (c\ A ® H' ont , (3.63) 

L-l 

c)(cU®/b,c®^|01)(01| AjA+i =: \c)(c\ A ®H' stah . (3.64) 



As for n c if prop n c , recall that the verification circuit V consists of two phases: The copy 
phase, consisting of n CNOT gates copying the contents of A to C, and the verification 
phase, consisting of the remaining L — n gates of V . In other words, we can write 



prop 



3=1 j=n+l 



(3.65) 



where Y^=i Hj corresponds to the copy phase and Y^= n +i ^° ^ ne verification phase. 
Since during the verification phase, V does not act on A, we have for all j > n that 



n c #,n c 



\c)(c\ A ® 



{V 3 ) B ,c®\j)(3-l 



D -\(Vl) B ,c®\3-l){3\ D + (3.66) 

Jb,c®(\j)(j\ + \3-1)(3-1\)d (3.67) 
=: \c){c\ A ®H' j . (3.68) 
As for the copy phase, let \i){i\ <8> I act on B <8> B for % e {0, 1}. Then, observe that 

<g> /) CNOT(|i)(i| <g> I) = <g> X\ (3.69) 

where X is the Pauli X operator and X 1 — X if % — 1 and X 1 = / otherwise. This implies 
that for any step j < n, i.e. where V applies a CNOT gate with qubit Aj as control and 
Cj as target, and letting Cj denote the jth bit of c, we have 



U c HjU c 



\c)(c\ A ® 



® |J')(J - 1| D - ^g- ® |J " 1)0'Id+ 



/®(U)0'| + b'-i)0'-i|)D 



=: |c)(c| A <g)#'( C ), 



(3.70) 

(3.71) 
(3.72) 



where the notation Hj(c) means H'j is a function of c. Letting i? prop (c) := ~Y^i=\H'j + 
ELi ^( c ) and H ( c ) ■= H L + Kut + ^stab + H'(c), we thus have that 



\H\c) 



\H(c)\1>). 



(3.73) 



It thus suffices to show that H(c) >z bl. 

To see this, we return to the circuit V, and think of V not as accepting classical input 
c, but rather as corresponding to a set of circuits {V c }, where each V c is just V with c 
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hard-wired into register A. In particular, at time step < j < n, V c applies X Cj to qubit 
Cj. Taking this interpretation, we observe that for any string c, plugging V c into Kitaev's 
circuit-to-Hamiltonian yields precisely the Hamiltonian H(c). Thus, since by assumption 
for our particular choice of c, V" accepts |c) <g) \q) for all quantum proofs \q), it follows that 
V c rejects all \q) with probability at least 1 — e. Hence, Lemma 3.19 implies H(c) >z bl, as 
desired. 

The converse direction proceeds similarly. Namely, suppose we have a NO instance of 
II, i.e. for all bit strings |c), there exists a quantum proof \q) such that V" rejects |c) <S> \q) 
with probability at least 1 — e. Then, we wish to show that for all c, there exists a \ip c ) 
such that (ip c \H(c)\if} c ) < a, for a from Lemma 3.19. To show this, fix an arbitrary c. Since 
there exists a |g) such that V c accepts \q) with probability at least 1 — e, it follows that the 
history state \ip c ) := J2i=o Vi" ' ^Al) b ® |0 • • • 0) c <g) \i) D indeed satisfies 

(i) c \H{c)\i) c ) = (i> c \H! Q + H' out + H' Btah + H' prop (c)\^ c ) < + a + + = a. (3.74) 

□ 

Note that the proof of Theorem 3.9 has a special property — the string c fed into the 
classical proof register of V" is mapped directly in our reduction to the candidate ground 
states \c)\q) for 3-local Hamiltonian H. This means, for example, that if there exists a c 
with the desired properties for a YES instance of our starting cq-S 2 problem, then setting 
x = c in Definition 3.28 yields that the cq-S 2 LH instance we have mapped to is also a 
YES instance. It follows that applying the reduction in the proof of Theorem 3.9 to our 
hard-to-approximate instance of QMW from Theorem 3.26 directly yields Theorem 3.10, 
i.e. that the following variant of cq-S2LH, which we call cq-I^LH-HW, is cq-S2-hard to 
approximate. Intuitively, cq-S 2 LH-HW is defined analogously to cq-S 2 LH, except that 
here the goal is to minimize the Hamming weight of x. 

Definition 3.30 (cq-X^LH-HW). Given a 3-local Hamiltonian H acting on N = n + m 
qubits, a, b G M. such that a < b for b — a > 1, and integer thresholds < g < g' , output: 

• YES if there exists x G {0, l} n of Hamming weight at most g such that for all \y) G 
B® m , Tr(H\x){x\ <g> \y)(y\) > b. 

• NO if for all x G {0, l} n of Hamming weight at most g' , there exists \y) G B® m such 
that Tv(H\x){x\ <g> \y)(y\) < a. 
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Chapter 4 



QMA variants with polynomially 
many provers 

This chapter is based on [111]: 

S. Gharibian, J. Sikora, and S. Upadhyay. QMA variants with polynomially many provers. 
Available at arXiv.org e-Print quant-ph/1108.0617vl, 2011. 

In this chapter, we study three variants of multi-prover quantum Merlin-Arthur proof 
systems. We first show that the class of problems that can be efficiently verified using 
polynomially many quantum proofs, each of logarithmic-size, is exactly MQA (also known 
as QCMA), the class of problems which can be efficiently verified via a classical proof 
and a quantum verifier. We then study the class BellQMA(poly), characterized by a veri- 
fier who first applies unentangled, nonadaptive measurements to each of the polynomially 
many proofs, followed by an arbitrary but efficient quantum verification circuit on the 
resulting measurement outcomes. We show that if the number of outcomes per nonadap- 
tive measurement is a polynomially-bounded function, then the expressive power of the 
proof system is exactly QMA. Finally, we study a class equivalent to QMA(m), denoted 
SepQMA(m), where the verifier's measurement operator corresponding to outcome accept 
is a fully separable operator across the m quantum proofs. Using cone programming du- 
ality, we give an alternate proof of a result of Harrow and Montanaro [128] that shows a 
perfect parallel repetition theorem for SepQMA(m) for any m. 
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4.1 Introduction and results 



The study of classical proof systems has yielded some of the greatest achievements in the- 
oretical computer science, from the Cook-Levin theorem [72, 179], which formally ushered 
in the age of NP verification systems and the now ubiquitous notion of NP-hardness, to 
the more modern PCP theorem [30, 29], which led to significant advancements in our un- 
derstanding of hardness of approximation. A natural generalization of the class NP to 
the quantum setting is the class quantum Merlin- Arthur (QMA) [171], where a computa- 
tionally powerful but untrustworthy prover, Merlin, sends a quantum proof to convince an 
efficient quantum verifier, Arthur, that a given input string x G {0, 1}™ is a YES-instance 
for a specified promise problem. (See Definition 1.5 for a formal definition of QMA.) It is 
easy to see that QMA proof systems are at least as powerful as NP, since the ability to 
process and exchange quantum information does not prevent Arthur from choosing to act 
classically. 

As discussed in Sections 1.5.2 and 1.5.3, much attention has been devoted to QMA over 
recent years. We now have a number of problems which are complete for QMA, with the 
quantum analogue of classical constraint satisfaction, the physically well-motivated fc-local 
Hamiltonian problem [171], being the canonical QMA-complete problem. Further, QMA is 
an extremely robust complexity class that satisfies strong error-reduction properties [191]. 
However, there still remain important open questions. One natural such question, which 
is the focus of this chapter, is: How does allowing multiple unentangled provers affect the 
expressive power of QMA ? 

Specifically, unlike in the classical setting where allowing multiple proofs, each quanti- 
fied by a distinct existential quantifier, is trivially equivalent to a single existentially quan- 
tified proof, whether the same logic holds in the quantum setting is a highly non-trivial 
open question due to the quantum phenomenon known as entanglement (see Section 1.6.1). 
Intuitively, entanglement between multiple proofs can be used by cheating provers to corre- 
late their proofs in a way stronger than possible classically. To this end, in this chapter, we 
are interested in studying the class QMA(poly) [ ], a.k.a. quantum Merlin- Arthur proof 
systems with polynomially many Merlins, where the verifier receives a polynomial number 
of quantum proofs which are promised to be unentangled with each other. Despite much 
effort, little is known (more details under Previous Work below) about the structural prop- 
erties of QMA(poly), except for the obvious containments QMA C QMA(poly) C NEXP. 

Our Results: We show the following three results regarding variants of QMA(poly). 
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A complete characterization in the logarithmic-size message setting. Let the 

class QMA log (poly) denote the restriction of the class QMA(poly) to the setting where 
each prover's proof is at most a logarithmic number of quantum bits, or qubits. We show: 

Theorem 4.1. QMA log (poly) = MQA. 

Here, recall from Chapter 1 that MQA, also known as QCMA, is defined as QMA except 
Merlin's proof is a polynomial-size classical string. Theorem 4.1 says that if each prover is 
restricted to sending short quantum proofs, then one can not only do away with multiple 
provers, but also of the need for quantum proofs altogether. 

Towards a non-trivial upper bound on BellQMA(poly). One possible approach 

to the question of QMA = QMA(poly) is to study BellQMA(poly) [52, 10, 64]. Bel- 
lQMA(poly) is defined analogously to QMA(poly), except that before applying his verifi- 
cation circuit to the polynomially many unentangled quantum proofs, Arthur must measure 
each proof using a nonadaptive and unentangled (across all proofs) measurement (we call 
this Stage 1 of the verification). He then feeds the resulting classical outcomes induced 
by these measurements into an efficient quantum circuit (we call this Stage 2), which 
implements a two-outcome measurement operation corresponding to outcomes accept and 
reject. 

The significance of BellQMA(poly) here is that if QMA ^ BellQMA(poly), then it 
follows that QMA ^ QMA(poly), since QMA C BellQMA(poly) C QMA(poly). To 
this end, Brandao has shown that for constant m, QMA = BellQMA(m) [ ]. Where 
BellQMA(poly) lies, however, remains open. For example, the techniques used to show 
QMA(2) = QMA(poly) [128] do not seem to yield an analogous result BellQMA(2) = 
BellQMA(poly) as they require entangled measurements (i.e. SWAP test measurements) 
across multiple proofs, which violate the definition of BellQMA. 

To make progress on BellQMA(poly), we introduce the class BellQMA [r, m], which is 
defined to be BellQMA(m) with m provers and the additional restriction that in Stage 1 
above, the number of outcomes per proof in Arthur's nonadaptive measurements is upper 
bounded by r. We then show the following: 

Theorem 4.2. For any polynomially bounded functions r, m : N — > N, it holds that 
BellQMA[r, m] C QMA (where the containment holds with equality when r > 2). 

In other words, BellQMA(poly) cannot be used to show that QMA ^ QMA(poly) if the 
verifier in the BellQMA (poly) protocol is restricted to have a polynomially bounded number 
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of measurement outcomes per proof in Stage 1. We remark that, in general, the number 
of such measurement outcomes can be exponential in the input length — the restriction 
that r be a polynomially bounded function is crucial for the proof of Theorem 4.2. For 
this reason, our result complements, rather than subsumes Brandao's result [52]. In other 
words, in our notation, Brandao has shown that BellQMA[exp, const] = QMA, and we 
show BellQMA [poly, poly] = QMA. 

Note that we allow the second stage of the BellQMA(poly) verification procedure above 
to be quantum, as per the definition suggested by Chen and Drucker [64], as opposed to 
classical, as studied by Brandao [52]. The conclusion of Theorem 4.2 holds even if the 
second stage of verification is completely classical. 

Finally, it is worth noting that by combining Theorems 4.1 and 4.2, we conclude that in 
the setting of BellQMA(poly), if MQA ^ QMA, then having the Merlins send logarithmic- 
size proofs without any restriction on the number of local measurement outcomes of Arthur 
in Stage 1 has less expressive power than sending polynomial-size proofs but restricting 
the number of outcomes, even though the number of measurement outcomes in Stage 1 
per Merlin in both cases is the same, i.e. polynomial in the input length. 

Perfect parallel repetition for SepQMA(m). A key question in designing proof sys- 
tems is how to improve the completeness and soundness parameters of a verification pro- 
tocol without increasing the required number of rounds of communication. A natural 
approach for doing so is to repeat the protocol multiple times in parallel. With QMA, 
however, this raises the concern that Merlin might try to cheat by entangling his proofs 
across these parallel runs. If, though, perfect parallel repetition holds, it means that for any 
input string x, if the verification procedure V accepts with probability p(|x|), then if we 
run V k times in parallel, the probability of accepting in all k runs of V is precisely p(|a;|) fc . 
In other words, if perfect parallel repetition holds, there is no incentive for Merlin to cheat 
- an honest proof which is a product state across all k runs achieves the maximum success 
probability. 

Our final contribution is an alternate proof of a perfect parallel repetition theorem for a 
class equivalent to QMA(m), namely SepQMA(m). The theorem was first proved in Har- 
row and Montanaro [128] in connection with an error reduction technique for QMA(poly). 
However, our proof is significantly different from theirs and uses the cone programming 
characterization of QMA(poly). Here, SepQMA(m) is defined as QMA(m) with the restric- 
tion that Arthur's measurement operator corresponding to acceptance is an unentangled, 
or separable, operator across the m unentangled proofs. We show: 

Theorem 4.3. SepQMA(m) admits perfect parallel repetition. 
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Our alternate proof of Theorem 4.3 is significant in that, to the best of our knowledge, it 
is the first use of duality theory for a cone program other than a semidefinite program to 
establish a parallel repetition result (note that cone programming generalizes semidefinite 
programming). We remark that semidefinite programs have been previously used to show 
perfect or strong parallel repetition theorems for various other models of (single or two- 
prover) quantum interactive proof systems [71, 127, 165], and that the alternate proof of 
Theorem 4.3 of Harrow and Montanaro is not based on semidefinite programming. Perfect 
parallel repetition for SepQMA(m) in itself is interesting, as it has been used to show that 
error reduction is possible for QMA(m) proof systems [128]. 

Proof ideas and tools: The proof of our first result, Theorem 4.1, is simple, and is an 
application of the facts that (1) quantum states of a logarithmic number of qubits can be 
described to within inverse exponential precision using a polynomial number of classical 
bits, and conversely that (2) given such a classical description, a logarithmic-size quantum 
state can be efficiently prepared by a quantum circuit. Hence, roughly speaking, one can 
replace a polynomial number of logarithmic-size quantum proofs with a single polynomial 
size classical proof, thereby avoiding the danger of a cheating Merlin using entanglement. 
Although the proof is simple, one cannot hope for a better characterization using other 
techniques because the reverse containment, i.e. MQA C QMA log (poly), also holds using 
similar ideas. 

More technically challenging is our second result, Theorem 4.2. To show the non-trivial 
direction BellQMA[poly, poly] C QMA, we simulate an arbitrary BellQMAfpoly, poly] pro- 
tocol by a QMA protocol using the following observation: Although consolidating m quan- 
tum proofs into a single quantum proof raises the possibility of cheating using entanglement, 
if Arthur is also sent an appropriate classical "consistency-check" string, then a dishonest 
Merlin can be caught with non-negligible probability. Specifically, in our QMA protocol, 
we ask a single Merlin to send the m quantum proofs of the original BellQMA protocol 
(denoted by a single state | ■?/>)), accompanied by a "consistency-check" string p which is 
a classical description of the probability distributions obtained as the output of Stage 1. 
One can think of this as having the QMA verifier delegate Stage 1 of the BellQMA ver- 
ification to Merlin. Arthur then performs a consistency check between and p based 
on the premise that if Merlin is honest, then p should arise from running Stage 1 of the 
original verification on If this check passes, then Arthur runs Stage 2 of the BellQMA 
verification on p. If Merlin tries to cheat, however, we show that the check detects this 
with non- negligible probability, hence achieving the desired containment. Note that the ac- 
curacy of the consistency check crucially uses the fact that there are at most polynomially 
many outcomes to check for each local measurement of Stage 1. 
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Our last result, Theorem 4.3, is shown using duality theory for cone programs. In 
particular, we phrase the maximum acceptance probability of a (possibly cheating) prover 
for the two-fold repetition of a SepQMA(m) verification protocol as a cone program. We 
then demonstrate a feasible solution for its dual yielding an upper bound on the maximum 
acceptance probability. The objective value of this dual solution is precisely the product 
of the optimum values of the two instances of the SepQMA(m) verification protocols. We 
conclude that one of the optimal strategies of the provers is to be faithful in the following 
sense: Each prover elects not to entangle his/her two quantum proofs for the two instances 
of the SepQMA(m) protocol and instead sends a tensor product of optimal proofs for both 
the instances. 

Previous work. The expressive power of multiple Merlins was first studied by Kobayashi, 
Matsumoto and Yamakami [175], who showed that QMA(2) = QMA(poly) if and only if 
the class of QMA(2) protocols with completeness c and soundness s (with at least inverse 
polynomial gap) is exactly equal to QMA(2) protocols with completeness 2/3 and sound- 
ness 1/3. Recently, Harrow and Montanaro [128] demonstrated a product state test, wherein 
given two copies of a pure quantum state on multiple systems, the test distinguishes be- 
tween the cases when the quantum state is a fully product state across all the systems or far 
from any such state. Using this test, they answered a few important questions regarding 
QMA(poly). In particular, they showed that 

QMA(2) = QMA(poly) (4.1) 

and that error reduction is possible for such proof systems. Prior to their result, the 
answers to both the questions were known to be affirmative assuming a weak version of 
the Additivity Conjecture [10]. One of the crucial properties of the product state test is 
that it can be converted into a QMA(2) protocol, where Arthur's measurement operator 
corresponding to outcome accept is a separable operator across the two proofs. Harrow 
and Montanaro established a perfect parallel repetition theorem for such proof systems, a 
crucial step in obtaining exponentially small error probabilities. 

Blier and Tapp initiated the study of logarithmic-size unentangled quantum proofs [48] . 
They showed that two unentangled quantum proofs suffice to show that a 3-coloring of 
an input graph exists, implying that NP has succinct unentangled quantum proofs. A 
drawback of their protocol is that although it has perfect completeness, its soundness is 
only inverse polynomially bounded away from 1. Shortly after, Aaronson, Beigi, Drucker, 
Fefferman and Shor [10] showed that satisfiability of any 3-SAT formula of size n can be 
proven by 0(y/n) unentangled quantum proofs of 0(log n) qubits with perfect completeness 
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and constant soundness (see also [64]). In a subsequent paper [39], Beigi improved directly 
on Blier and Tapp's result [48] by showing that by sacrificing perfect completeness, one 
can show that NP has two logarithmic-size quantum proofs with a better gap between 
completeness and soundness probabilities than in [48] (see also Chiesa and Forbes [65] and 
Le Gall, Nakagawa, and Nishimura [103] for related improvements which do not sacrifice 
perfect completeness). 

Finally, one of the open questions raised in Reference [10] concerns the power of Arthur's 
verification procedure. In particular, the paper introduces two different classes of verifica- 
tion procedures, BellQMA and LOCCQMA verification. Roughly speaking, LOCCQMA 
verification corresponds to Arthur applying a measurement operation that can be imple- 
mented by Local Operations and Classical Communication (LOCC) (with respect to the 
partition induced by the multiple proofs). The authors raised the question of whether 
BellQMA(poly) = QMA or not. Brandao [ ] showed that BellQMA(m) is equal to QMA 
for constant m. In a recent development, Brandao, Christandl and Yard [53] showed that 
LOCCQMA(m) is equal to QMA for constant m. 

Open problems. A natural open question concerning the results presented in this chap- 
ter is the relationship between BellQMA (poly) and QMA. We believe that understanding 
the complexity of BellQMA protocols will shed new light on the bigger question pertaining 
to QMA (2) and QMA. Another avenue of interest is to find further applications of the cone 
programming characterization of multi-prover quantum Merlin- Arthur proof systems. One 
question concerning the parallel repetition result presented in this chapter is to investigate 
whether cone programming duality can be used to analyze the product state test in Refer- 
ence [128]. Finally, it would be interesting to find other classes of QMA(m) protocols that 
admit a perfect parallel repetition theorem. 

Organization of this chapter. We begin in Section 4.2 with background and notation, 
defining relevant complexity classes in Section 4.2.1, and reviewing cone programming 
in Section 4.2.2. Theorems 4.1, 4.2, and 4.3 are proved in Sections 4.3, 4.4, and 4.5, 
respectively. 

4.2 Preliminaries 

In this section, we state useful lemmas, and discuss relevant complexity classes and cone 
programming. Throughout the chapter, we use \x\ to denote the length of string x G 
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{0,1}*. The standard Hilbert-Schmidt inner product of operators A and B is denoted 
(A, B) := Tr(A^B), where A* denotes the adjoint of A. 

First, a useful lemma in this chapter regarding the trace norm (which is a Schatten 
p-norm with p — 1) is the following: 

Lemma 4.4 ([244]). Let {pi . . . ,pk} C D (X) and {a\, . . . , a k } C D (X). Then for any 
Schatten p-norm, 



Pi 



1=1 



i=l 



<£iia 

8=1 



(4.2) 



Next, generalizing Definition 1.122, we say a (possibly unnormalized) operator A G 
Pos (Xi g) • • • <g) ,Y m ) is /u/Zy separable (i.e. unentangled) if it can be written as 

fe 

A = 2P 1 (i)®-..®P ra (i), (4.3) 

j=i 

where -Pj(^) G Pos {Xj), for every j G [m] and z G [fc]. We denote the cone of fully separable 
operators as Sep (Xi, X 2 , . . . , X m ). In the setting of quantum information, one typically also 
has Tr(A) = 1. It will be useful to note that the set of fully separable density operators is 
convex, compact, and has non-empty interior since it contains a ball around the normalized 
identity operator [124, 125, 126]. 



4.2.1 Relevant complexity classes 

We now define the relevant complexity classes specific to this chapter. Recall that a promise 
problem A = (A yes ,A no ) is a partition of the set {0,1}* into three disjoint subsets: the 
set A yes denotes the set of YES-instances of the problem, the set A no denotes the set of 
NO-instances of the problem, and {0, l}*\(Ay CS U A no ) is the set of disallowed strings. 

We begin by formally generalizing the definition of QMA (see Definition 1.5) to the 
setting of m unentangled provers. 

Definition 4.5 (QMA(m)). A promise problem A = (A yes , A QO ) is in QMA(m) if there ex- 
ist polynomials p , q and a polynomial-time uniform family of quantum circuits {Q„}, where 
Q n takes as input a string x G S* with \x\ =: n, quantum proof \y) G Sep (Xi, X 2 , . . . , X m ( n ^ 
where Xi := (C 2 ) (g,p<n - ) for i G [m(n)] ; and q(n) ancilla qubits in state lo)® 9 ^, such that: 
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• (Completeness) If x G A yes , then there exists a proof \y) G Sep (X\, X 2 , . . . , X m (n)) 
such that Q n accepts (x, \y)) with probability at least 2/3. 

• (Soundness) If x G A no , then for all proofs \y) G Sep (Xi, X 2 , . . . , X m (n)) , Qn accepts 
(x, \y)) with probability at most 1/3. 

The class QMA(poly) is defined as QMA(poly) := [J mgpoly QMA(m). 

For clarity, note that \y) G Sep (Xi, X 2 , . . . , X m ( n y) must have the form \y) = \yi) g) • • • <g) 
\Vm{n)) f° r \Vi) ^ Hence, QMA(m) can be thought of as having m[n) unentangled 
provers. Note that like QMA = QMA(l), the constants 2/3 and 1/3 above can be ampli- 
fied to values exponentially close to 1 and 0, respectively, by having the verifier run the 
verification procedure polynomially times in parallel (this requires increasing the number 
of provers, however). Also, we will use the fact that corresponding to any QMA(m) pro- 
tocol is a two-outcome POVM (see Section 1.4.2) consisting of operators {C accept , C re ject}, 
such that for any candidate proof \ip) = \ipi) <g) • • • <g) r0m(n))) the probability of the verifier 
accepting (rejecting) is given by Tr(C accept |^) (Tr(C re ject|'0}('0|))- 

All complexity classes considered in this chapter are variants of QMA(m) and satisfy 
the properties mentioned above in Definition 4.5. The next two classes we define are: 

1. [QMA log (poly)] A subclass of QMA(poly) in which each Merlin's message to Arthur 
is 0(log(|x|)) qubits in length for input string x. 

2. [SepQMA(poly)] A subclass of QMA(poly), wherein Arthur's measurement operator 
Caccept corresponding to outcome accept is a fully separable operator across the proofs. 

For clarity, we next give a more formal definition of the variant of BellQMA we introduce, 
BellQMAfr, m]. 

Definition 4.6 (BellQMA [r, m]). Let r, m : N — > N be two functions. A promise problem 
A = (A yes , A no ) is in class BellQMA[r, m] if there exists a QMA(m) verification protocol 
in which Arthur is restricted to act as follows. 

1. Arthur performs a polynomial-time quantum computation on the input x and generates 
a description of quantum circuits Vi(x), . . . , V m (x), one for each of the m provers. 

2. (Stage 1) Arthur simultaneously measures all m quantum proofs by applying Vi(x) to 
the i-th quantum proof, where the action ofVi(x) can be described by a unitary operator 
followed by measurement in the standard basis. The label of the i-th measurement 
outcome is stored as a classical string yi also identified as an element of [r(\x\)]. 
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3. (Stage 2) Arthur runs an efficient quantum verification circuit on input x and measure- 
ment outcomes (yi, . . . ,y m ) to decide whether to accept or reject. 

Note that the key distinction between BellQMAfr, m] and BellQMA(poly) is that the for- 
mer has the number of measurement outcomes in Stage 1 of the protocol bounded by 
r(|x|), whereas the latter may allow exponentially many possible outcomes. Throughout 
this chapter, we use the notation BellQMA [poly, poly] to denote 



4.2.2 Cone programming 

We now briefly review basic notions in conic optimization (or cone programming), which 
is a generalization of semidefmite optimization. The reader is referred to the text of Boyd 
and Vandenberghe [51] for further details. 

To begin, recall that a set K in an underlying Euclidean space is a cone if x G K 
implies that Xx G K for all A > 0. A cone K is convex if x, y G K implies that x + y G K . 
Cone programs are concerned with optimizing a linear function over the intersection of a 
convex cone and an affine space. It generalizes several well-studied models of optimization 
including semidefmite programming (where K = Pos(A')) and linear programming (where 
K = R"). In this chapter, we are primarily concerned with the cone of fully separable 
operators K = Sep (X\, X%, . . . , X m ), which as stated in Section 4.2 is a closed, convex cone 
with non-empty interior. 

A cone program associates the following 4-tuple (C, b, A, K) to an optimization problem, 
which we denote as the Primal problem: 



where A : Span(i^) — > IR m is a linear transformation, and K lies in a real Euclidean space. 
(Note that the choice of inner product in (X, C) depends on the Euclidean space K lies 
in.) We say that the cone program is feasible if {X : A(X) = b} fl K is non-empty and 




(4.4) 



Primal problem (P) 



supremum: (X, C) 
subject to: A(X) = b, 
X G K, 



(4.5) 
(4.6) 
(4.7) 
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strictly feasible if {X : A(X) = b} fl mt(K) is non-empty, where int(-) denotes the interior 
of a set. 



Next, associated with a cone K is its dual cone K*, defined as 



K* = {S : (X, S)>0 for all X G K} . 



(4.8) 



Via the dual cone, for every Primal problem, one can define an associated Dual problem 
as follows: 



where A* is the adjoint of A. We remark that so long as K is closed (which is the case for 
the cone of fully separable operators), the roles of the Primal and Dual problems can be 
freely interchanged, since a convex cone K is closed if and only if K = K**. 

The problems (P) and (D) obey the following special relationship. 
Lemma 4.7 (Weak Duality). If X is primal feasible and {y,S) is dual feasible then 



In other words, let the optimal values of (P) and (D) be denoted p* and d*, respectively. 
Then p* < d*. This raises the important question: Does p* = d*l In general, this is not 
the case. However, if indeed p* = d*, we say that strong duality holds. Below we give a 
condition which, if satisfied, guarantees that strong duality holds. 

Theorem 4.8 (Strong Duality). // (P) is strictly feasible, then strong duality holds, i.e. 
p* = d* . In particular, this implies that if p* is finite, then both (P) and (D) attain their 
optimal values, which coincide. 

Note that when K is a closed, convex cone, one can flip the roles of primal and dual 
problems in Theorem 4.8. 



Dual problem (D) 



infimum: (b, y) 
subject to: A*(y) = C + S, 
S G K*, 



(4.9) 
(4.10) 
(4.11) 



(b,y)-(X,C) = (X,S)>0. 



(4.12) 
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4.3 Equivalence of MQA and QMA log (poly) 

We now prove Theorem 4.1, i.e. that MQA = QMA log (poly). We first show the direction 
MQA C QMA log (poly). Let A = (A yes ,A no ) be a promise problem in MQA and let 
x G {0, 1}™ be the input string. Suppose the MQA prover sends an m-bit classical proof to 
the verifier, for polynomially bounded m. Then the following straightforward QMA log (m) 
protocol achieves the desired containment: 

1. Embed classical bits into qubits. Each (unentangled) prover i G [m] sends a 
single qubit \ipj) G C 2 to Arthur. If the z-th prover is honest, his/her qubit is the 
computational basis state corresponding to the z-th bit of the classical MQA proof. 

2. Make things classical again. Arthur measures all proofs in the computational basis, 
obtaining a classical string y G {0, l} m . 

3. Run MQA verification. Arthur runs the MQA verification circuit on x and y and 
accepts if and only if acceptance occurs in the MQA verification. 

The completeness property follows straightforwardly. The soundness property is also easy 
to observe. Note that Arthur runs the MQA verification on a classical string y and hence 
he accepts the string with probability at most 1/3. 

To show the reverse containment, let A = (A ye8 , A no ) be a promise problem in class 
QMA log (poly) and let x G {0, l} n be the input string. Suppose we have a QMA log (m) 
protocol for polynomially bounded m, where prover i sends a |~c log n] -qubit state for 
some constant c > 0. Let r{n) = 2r clogn l = 0(n c ). The MQA protocol proceeds as follows: 

1. Describe proofs classically. The prover sends m classical registers represented by the 
tuple (Ci, C 2 , . . . , C m ), each of length 2n-r{n) to Arthur. If the prover is honest, register 
Q contains a classical description of the z-th quantum proof of the QMA log (m) 
protocol. 

2. State preparation. Using the contents of register Q, for every choice of i G [m], 
Arthur prepares the state by first determining a unitary Ui such that Ui\0 . . . 0) = 
l^i), and then implementing Ui with high precision using a finite set of approximately 
universal gates, obtaining states |^). 

3. Run QMA log (m) verification. Arthur runs the QMA log (m) verification circuit on 

<S> ■ ■ -® \ip' m ) and accepts if and only if acceptance occurs in QMAj (m) verification. 
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Observe that each classical register Q is of size polynomial in n, implying the overall 
proof length is of polynomial size. In Step 1, the prover uses n bits to represent the real 
and imaginary parts of each of the polynomially many entities (r(n) entries) required to 
describe each Let the unit vector described by register Q be denoted \ipi). In Step 2, 
Ui is easily found, as the unitary that maps |0 . . . 0) to \ipi) is the inverse of the unitary that 
maps to |0 . . . 0). Next, Ui can be efficiently decomposed into a product of U[ one- and 
two-qubit unitary gates (see Bernstein and Vazirani [46] for details, or Section 1.5.1 under 
"Universal gate sets" for a brief discussion) such that || Ui — U[ \ OQ is inverse exponentially 
small. Since Steps 1 and 2 can be performed to within inverse exponential error, we 
thus can ensure || — || < e for all i G [m] and for inverse exponential e > 0. By 
Lemma 4.4, it follows that the overall precision error is at most me for polynomial m, and 
thus the completeness and soundness of the protocol are bounded from below and above 
by | — me and | + me, respectively. 

4.4 Equivalence of BellQMA[poly, poly] and QMA 

We now show Theorem 4.2, i.e. that BellQMAfr, m] = QMA for polynomially-bounded 
functions r and m. For notational convenience, let Hj(i) denote Arthur's i-th POVM ele- 
ment in Stage 1 of the BellQMA verification protocol for the j-th prover (i.e. Y7i=i IT,- (2) = 
/), where we assume without loss of generality that the number of possible outcomes is 
exactly r for each prover, and where j G [m] for m the number of provers. 

We proceed as follows. Let A = (A yes ,A QO ) be a promise problem, and x be an input 
string of length n := \x\. Note first that the containment QMA C BellQMA [poly, poly] 
follows since, by definition, QMA C BellQMA[2, 1]. For the reverse containment, suppose 
we have a BellQMA [r, m] protocol for polynomially bounded functions r, m : N — > N with 
completeness 2/3 and soundness 1/3. We show that this protocol can be simulated by a 
QMA protocol as follows. 

Merlin's proof consists of two registers (X, Y), which should be thought of as the clas- 
sical and quantum registers, respectively. Suppose optimal proofs for the BellQMA [r, m] 
protocol for input x are given by pj for j G [m]. Then, in the quantum register Y, an honest 
Merlin should send many copies of the state pj. Specifically, Y is partitioned into m registers 
Yj, one for each original prover, and each Yj should contain k copies of pj, for k a carefully 
chosen polynomial. In other words, Y should contain the state [pf k ]y 1 ® ■ ■ ■ £g> [Pm fc ]v m - We 
further view each Yj as a block of registers (Yj, . . . , Y^) where Yj should contain the Z-th 
copy of pj. 
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In the classical register X, Merlin sends the classical "consistency check" string al- 
luded to in Section 4.1. Specifically, an honest Merlin prepares a quantum state in the 
computational basis, which intuitively corresponds to a bit string describing the m clas- 
sical probability distributions Arthur induces upon applying the measurement operation 
corresponding to Stage 1 of the BellQMA verification to each of the optimal proofs pj, 
respectively. More formally, we partition X into mr registers X* corresponding to each of 
the j G [m] provers and i G [r] POVM outcomes per prover. The content of X*- should 
be Pj(i) '■= (Hj(i),Pj), truncated to a bits of precision (a polynomially bounded), such 
that YJi=\Pj{^) = 1- For example, if the j-th prover's proof was the single qubit state 
Pj = |0)(0|, with IL;(1) = |0)(0| and 11,(2) = |1)(1|, then Xj = (1,0). 

Of course, Merlin may elect to be dishonest and choose not to send a proof of the above 
form to Arthur by, e.g., sending a quantum state which is entangled across the registers 
(X, Y). To catch this, our QMA protocol is defined as follows: 



1. Merlin sends Arthur a quantum state in registers (X, Y), for X and Y defined as above. 

2. Force X to be classical. Arthur measures register X in the computational basis and 
reads the measurement outcome. This forces X to essentially be a classical register of 
bits, and destroys any entanglement or correlations between X and Y. 

3. X should contain probability distributions. Arthur checks whether the content of 
registers Xj form a probability distribution pj. Arthur rejects if this is not the case. 

4. Consistency check: Can the quantum states in Y reproduce the distribu- 
tions in X? Arthur picks independently and uniformly at random, an index j G [m] 
and another index % G [r]. He applies the measurement {Uj(i)}l =l separately to each 
register Yj, . . . , Yj, and counts the number of times outcome i appears, which we denote 
henceforth as rij(i). Arthur rejects if 



k 



> -, (4.13) 
P 



for p a carefully chosen polynomial. 

Run Stage 2 of the BellQMA verification and repeat for error reduction. 

For each prover j, Arthur samples an outcome from [r] according to the distribution 
in (Xj, . . . ,Xp, and runs Stage 2 of the BellQMA verification on the resulting set of 
samples. He repeats this process independently a polynomial number of times q, and 
accepts if and only if the BellQMA procedure accepts on the majority of the runs. 
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Let us discuss the intuition behind the verification procedure above. The key is Step 4, 
where Arthur cross-checks that the classical distributions sent in X really can be obtained 
by measuring m quantum proofs, which for an honest Merlin should be unentangled. In 
this sense, our protocol can alternatively be viewed as using quantum proofs (Y) to check 
validity of a classical proof (X). Intuitively, the reason why entanglement in Y does not help 
a dishonest Merlin in Step 3 is due to the local nature of Arthur's checks /measurements. 
Finally, once Arthur is satisfied that X contains valid distributions, he runs Step 5. We 
remark that repetition is used here in order to boost the probability of acceptance in the x G 
A yes case to exponentially close to 1, which is required to separate it from the x G A no case, 
where the probability of catching a dishonest Merlin is only inverse polynomially bounded 
away from 1. Once such a gap exists, standard error reduction techniques [172, 191] (see 
Section 1.5.2) can be used to further improve completeness and soundness parameters. 

To formally analyze completeness and soundness of the QMA protocol, we assign the 
following values to the parameters, all of which are polynomial in n in our setting: 

q = 50n and p = 20mr and k = 5p 3 and a = 20nmr. (4.14) 



Completeness. Intuitively, when x G A yes , Merlin passes Step 4 with probability expo- 
nentially close to 1 since he has no incentive to cheat — he can send an unentangled proof 
in Step 1 to Arthur corresponding to the optimal proofs pj in the BellQMA protocol, such 
that the expected value of rij(i)/k is indeed Pj(i). Arthur's checks in Step 4 are then inde- 
pendent local trials, allowing a Chernoff bound to be applied. We then show that Merlin 
passes each run in Step 5 with constant probability, and applying the Chernoff bound a 
second time yields the desired completeness exponentially close to 1 for the protocol. 

To state this formally, suppose Merlin is honest and sends registers (X, Y) in the desired 
form, i.e., X* contains Pj(i) = (Hj(i), Pj) up to a bits of precision, and Yj contains pj. Then, 
the expected value of the random variable rij(i) is E[rij(i)] = k (Hj(i),pj), which is equal 



to k -Pj(i) up to the error incurred by representing pj\ 
words, 



i) using a bits of precision. In other 



E[n,-(i)] 



k 



< 



< 



2p 



We can hence upper bound the probability of rejecting in Step 3 by 

1 



Pr 



k 



> - 


< Pr 




V 







rij(i) E,[rij(i)] 



k 



k 



> 



2p 



< 2exp 



hp 

T 



(4.15) 



(4.16) 



where the first inequality follows from Equation (4.15) and the second from the Chernoff 
bound. Thus, Merlin passes Step 4 with probability exponentially close to 1. 
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We now turn to the final step. Since x G ^4 yes , we know that the optimal distributions, 
denoted qj := ((ILj(l), pj) , . . . , (Uj(r), pj)) for j G [m], obtained in Stage 1 of the original 
BellQMA protocol are now accepted in Stage 2 with probability at least 2/3. However, in 
our case, Merlin was only able to specify each qj up to a bits of precision per entry as the 
distributions pj. To analyze how this affects the probability of acceptance, let Pj and Qj 



be diagonal operators with entries Pj(i,i) = Pj(i) and Qj 



[iij(i),pj), 



respectively. 



Letting C accept denote the POVM element corresponding to outcome accept in Stage 2 of 
the BellQMA protocol, we thus bound the change in acceptance probability by: 



Tr 



Caccept [ (^) Pj — (^) Qj 



< 



'3 1 1 tr 



m r 



< 



j=i i=i 

mr 



2 20r, 



(4.17) 

(4.18) 

(4.19) 
(4.20) 



where the first inequality follows from the fact that |Tr(AB)| < || A || • || B || tr and the 
second inequality follows from Lemma 4.4. Therefore, the probability of success for each 
of the q runs of the BellQMA protocol in Step 5 is at least 



mr 



3 2 m 



> 0.6. 



(4.21) 



Since each run is independent, applying the Chernoff bound yields that Arthur accepts 
Merlin's proof in Step 5 with probability at least 1 — 2exp(— 0.02g), as desired. There may 
be some error incurred in sampling, which can be assumed to be exponentially small so 
that the success probability of each run is still at least 0.6. 



Soundness. We now prove that when x G A no , a dishonest Merlin can win with prob- 
ability at most inverse polynomially bounded away from 1. To show this, we bound the 
probability of passing Step 4 by relating the quantity Pj{i) to the expected value of rij(i)/k, 
and then apply the Markov bound. The desired relationship follows by observing first that 
the expected value of rij{i)/k is precisely the probability of obtaining outcome i when 
measuring proof j of some (honest) unentangled strategy, followed by arguing that the 
distribution pj must hence be far from this latter (honest) distribution if Merlin is to pass 
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Step 5 with probability at least 1/2 (since x G A QO ). Combining these facts, we find that 
Arthur detects a cheating Merlin with inverse polynomial probability in Step 4. 

More formally, let the quantum register Yj contain an arbitrary quantum state Oj whose 
reduced states in registers Yj for I G [k] are given by <Tj(l), and define 



k 

1=1 



(4.22) 



By the linearity of expectation, the expected value of the random variable rij{i)/k is 

k 

= rE(nj(»Vj(i)) = (n;(i).&>- (4-23) 



E 



k 



Our goal is to lower bound the expression 

rij(i) 



Pr 



k 



> 



P. 



(4.24) 



To achieve this, we first substitute Pj(i) above with a quantity involving E[rij(i)/k], and 
then apply the Markov bound. 

To relate K[rij(i)/k] to Pj(i), we first remark that in order for Merlin to pass each run 
of Step 5 with probability exponentially close to 1, he must send probability distributions 
Pj, which are accepted by Stage 2 of the BellQMA verification with probability at least 
1/2. Let 

(&(<):= (n^i),^). (4.25) 

Let us imagine a BellQMA protocol where the j-ih Merlin sends as his quantum proof. 
Since x G A no , by the soundness property of the BellQMA(m) proof system, the success 
probability of the Merlins is at most 1/3. In other words, sampling outcomes from the 
probability distributions (qj(l), . . . ,qj(r)) and then running the second stage of the Bel- 
lQMA verification will yield outcome accept with probability at most 1/3. Also, observe 
that 



E 



nj\i) 



k 



(4.26) 



It follows that by letting Pj and Qj be diagonal operators with the probability vectors pj 
and qj on their diagonals, respectively, and C accep t the POVM element corresponding to 
outcome accept in Stage 2 of the BellQMA protocol, we have 



10 



< 



Tr 
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3 1 1 tr • 



(4.27) 
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Here, the (loose) lower bound of 1/10 comes from the following two observations. First, the 
distributions represented by the Q/s are derived from a BellQMA protocol and therefore 
achieve a success probability at most 1/3 by the soundness property of the BellQMA 
verification. Second, the distributions represented by the P/s have to achieve a success 
probability strictly greater than 1/2 per run to guarantee that Merlin wins Step 5 with 
probability exponentially close to 1. Combining these two, we get that the difference 
between the success probabilities obtained by distributions {Pj} and {Qj} should be at 
least 1/6 modulo the error incurred due to finite precision when encoding the distributions 
Pj. The use of the constant 1/10 overcompensates for this precision error. Hence, there 
exists a j such that 

II Pj ~ Qj L = E IPiW - ^ 7^ZT> ( 4 -28) 

i=i 

implying the existence of an i such that 



10m' 



1 



lOmr 



(4.29) 



This is our desired relationship between Pj(i) and E,[rij(i)/k} = qj(i). Note that the prob- 
ability of picking pair in Step 4 is 1/mr. 

We now substitute this relationship into Equation (4.24) and apply the Markov bound. 
Specifically, choose i and j as in Equation (4.29), and assume that Pj(i) > (Hj(i),£j). 
Then, we have 



Pr 



< 



p. 



< Pr 



— E 



nj[i) 



> 
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lOmr p 



< 1 



1 

2p" 



(4.30) 



The case of Pj(i) < (Tlj(i),£j) is similar. We conclude that a dishonest Merlin is caught in 
Step 4 with probability at least l/2p. Therefore, the probability that Arthur proceeds to 
Step 5 is upper bounded by 



1 

mr 



20mr 



1 

mr 



D = l 



20m 2 r 2 ' 



(4.31) 



where the first term represents the case where Arthur selects the correct pair to 
check, and the second term the complementary case, in which we assume the cheating 
prover can win with probability 1. Hence the overall success probability of Merlin is at 
most 1 — l/20m 2 r 2 . 

Finally, as mentioned before, since m and r are polynomially bounded functions, we 
have that the completeness is exponentially close to 1, while the soundness is bounded away 
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from 1 by an inverse polynomial. By known error reduction techniques for QMA proto- 
cols [172, 191], one can amplify the completeness and soundness errors to be exponentially 
close to 0. This proves our desired containment. 

4.5 Perfect parallel repetition for SepQMA(poly) 

Using cone programming, we now show Theorem 4.3, i.e., that the class SepQMA(m) 
admits perfect parallel repetition. Recall now that for C the measurement operator cor- 
responding to outcome accept, the maximum success probability of the Merlins in any 
QMA(m) protocol can be written as the maximum of (p, C), where p is a density operator 
in the cone Sep (Xx, . . . , X m ). This is a simple cone program and can be written as the 
following primal-dual pair: 

Primal problem (P) Dual problem (D) 

max (p, C) min t 

s. t. Tr(p) = 1, s. t. tI x = C + W, 

p G Sep (Xx, ... , X m ) , W e Sep(;fi, . . . , X m )*, 

where X denotes Xx <g) • • - <S> X m , and Sep(Xx, . . . , X m )* is the dual cone defined as 

Sep^!, ■ ■ ■ , Xm)* ■= {W : (p, W) > for all p e Sep (X x , . . . , X m )} . (4.32) 

(Note that Sep(Ai, . . . ,X m )* contains the set of entanglement witnesses in the theory of 
entanglement, see [151].) Moreover, the use of "maximum" and "minimum" is justified 
in the above programs since p = di ^ x ~ ) and (t, W) = (2, 2I X — C) are strictly feasible 
solutions for (P) and (D), respectively [124, 125, 126] (i.e. strong duality (Theorem 4.8) 
holds). 

Given two protocols, the corresponding cone programs are completely specified by 
Arthur's POVM corresponding to outcome accept and the underlying cone: 

(Cx, Sep (Xx,..., X m )) and (C 2 , Sep (y u . . . , y m )), (4.33) 

while the parallel repetition protocol is specified by (Cx ®C%, Sep (Xx <8> 3^i, • • • , X m (g) y m )). 

To show Theorem 4.3, note first that if px and P2 are optimal solutions of the primal 
problems associated with the two individual protocols, then px <S> P2 is a feasible solution of 
the primal problem associated with the parallel repetition protocol. Therefore the success 
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probability of the parallel repetition is at least the product of the success probabilities of 
the individual protocols. We now show that no other strategy for the prover can perform 
better than this honest strategy. To do so, we demonstrate a feasible solution for the dual 
problem associated with the parallel repetition protocol attaining the same objective value. 

More formally, let (ti, W\) and (t 2 , W 2 ) be respective dual optimal solutions correspond- 
ing to two protocols. We show that (ti ■ t 2 , W) is a dual feasible solution corresponding to 
the two-fold repetition of protocols for some choice of W G Sep(Ai <g> y±, • • • , X m <g> y m )* . 
To do so, we first require the following lemma. 

Lemma 4.9. For complex Euclidean spaces X±, . . . , X m , y±, . . . , y m : 

• Sepf^i, . . . , X m )* ® Sep(3^i, . . . , y m ) C Sep(A\ ®yi,...,X m ® y m )* , and 

• Sep(^i, ...,X m )(g) Sep(^i, . . . , y m T C Sep(A' 1 ® y u . . . , X m <g> y m )* . 

Proof. We prove the first condition as the second is similar. Fix W G Sep(A\, . . . ,X m )* 
and C G Sep^i, . . . , y m ). Then for S G Sep(X 1 <g> y u . . . , X m <g> y m ), we have 

(W ®C,S) = (W, Try [S{I X <g) C)]) > 0, (4.34) 

if Try [S(I X ® C)] G Sep(<*i, ...,X m ). To this end, let 

km k' m 

s = J2 ( S>p^ and c = £(8K(0, ( 4 - 35 ) 

i=l 1=1 j=l 1=1 

where pi(l) G Pos (X t ® y t ) and a^l) G Pos(^) for all % G [k], j G [k'\, and I G [to]. Now 
we can write Try [S (I x ® C)} as 



Tr 



y 



era \ / k' m 

,i=i «=i / V i=i /=i 



(4.36) 



k k m 

i=l j = l /=1 



Hence, Try [S 1 (i* <8> C)] G Sep (A*!, . . . , X m ) since Tr^ [pi(Z) <8> ^(0)1 * s positive semidef- 
inite for all i, j, /. The latter follows since for positive semidefinite A X(gl y and By, 

Tr y (A x ® y I x ® By) = Tr y (I x ® B*A X QyI x ® B*) h 0, (4.37) 
which follows since C'DC >z if D >z 0. This concludes the proof. □ 
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We use Lemma 4.9 to construct two operators in Sep(Ai ® 3^, • • • , X m ® 3^m)*> the appro- 
priate convex combination of which is the dual feasible solution we are seeking. Specifi- 
cally, observe first that since for the two instances of the SepQMA(m) protocol, we have 
C\ G Sep (Xi, . . . , X m ) and C 2 G Sep . . . , y m ), and since 1% and Iy are fully separable 
operators, it follows that 

s x I x + G x G Sep(^!, • • • , X m ) and s 2 I y + C 2 e Sep(^i, . . . , y m ) (4.38) 

for all s\,s 2 > 0. Using Lemma 4.9, we thus obtain operators 

{tj x - Ci) ® (tjj/y + C 2 ) G Sep(A' 1 <g) tt, . . . , X m <g> ^ m )* (4.39) 

and 

(til* + d) ® (t 2 I y - C 2 ) G Sep(^ ®y u ...,X m ® y m Y. (4.40) 

Here we have used the fact that W = t\lx — C\ G Sep(Ai, . . . , X m )* since (ti, W) is by 
assumption the optimal dual solution for the first protocol (and similarly for the second 
protocol). Since Sep(Xi Cg> y±, . . . , X m eg) y m )* is a convex cone, it follows that the average 
of Equations (4.39) and (4.40) yields the desired operator 

W:=h- t 2 I x ®y -C x ®C 2 e Sep(*! <g> y u . . . , X m <g> y ro )*. (4.41) 

We conclude that (ti • t 2 , W) is a feasible solution of the dual problem associated with 
parallel repetition of protocols with objective value ti ■ t 2 as desired. This concludes the 
proof of Theorem 4.3. 
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Chapter 5 



Signatures of non-classicality in 
mixed-state quantum computation 

This chapter is based on [78]: 

A. Datta and S. Gharibian. Signatures of nonclassicality in mixed-state quantum compu- 
tation. Physical Review A, 79:042325, 2009, DOI: 10.1103/PhysRevA.79.042325, © 2009 
American Physical Society, pra.aps.org. 

In this chapter, we investigate signatures of non-classicality in quantum states, in partic- 
ular, those involved in the DQC1 model of mixed-state quantum computation [11 ]. To 
do so, we consider two known non-classicality criteria. The first quantifies disturbance 
of a quantum state under locally noneffective unitary operations (LNU), which are local 
unitaries acting invariantly on a subsystem. The second quantifies measurement induced 
disturbance (MID) in the eigenbasis of the reduced density matrices. We study the role of 
both figures of non-classicality in the exponential speedup of the DQC1 model and com- 
pare them vis-a-vis the interpretation provided in terms of quantum discord. In particular, 
we prove that a non-zero quantum discord implies a non-zero shift under LNUs. We also 
use the MID measure to study the locking of classical correlations [87] using two mutually 
unbiased bases (MUB). We find the MID measure to exactly correspond to the number of 
locked bits of correlation. 
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5.1 Introduction and results 



A thorough understanding of classical and quantum correlations underlies their success- 
ful exploitation in quantum information science. Characterizing the relative roles and 
abilities of these two forms of correlations in performing specific computational and infor- 
mation processing tasks would be a valuable advance in the field. Substantial progress in 
this direction has already been achieved. The role of entangled states in quantum informa- 
tion processing and computing is quite well studied. Jozsa and Linden [160] showed that 
multipartite entanglement must grow unboundedly with the problem size if a pure-state 
quantum computation is to attain an exponential speedup over its classical counterpart. In 
the context of information processing, Masanes has shown [192] that all bipartite entangled 
states can enhance the teleporting power of some other state. In spite of these successes, 
there are instances of quantum computations where the quantum advantage cannot be 
attributed to entanglement. Meyer has presented a quantum search algorithm that uses 
no entanglement [194]. Instances are also known of oracle based problems that can be 
solved without entanglement, yet with certain advantages over the best known classical 
algorithms [17, 166]. 

Given this scenario, it becomes a logical necessity to study the essentialness of en- 
tanglement in quantum information science. A realistic motivation is that provided by 
mixed-state quantum computation. Pure states in a quantum computation inevitably get 
mixed due to decoherence. One way to address this issue would be to study the prospects 
of quantum computational speedup with mixed states themselves [25]. NMR quantum 
computation provides a good scenario for this. As a simplified model for this, Knill and 
Laflamme proposed the DQC1 or the 'power of one qubit' model [174]. Though not be- 
lieved to be as powerful as a pure-state quantum computer, it is believed to provide an 
exponential speedup over the best known classical algorithm for estimating the normalized 
trace of a unitary matrix. The DQC1 model was found to have a limited amount of (bipar- 
tite) entanglement that does not increase with the system size. Additionally, for certain 
parameter settings, there is no distillable entanglement present whatsoever, and yet the 
model retains its exponential advantage. In this latter case the state has a positive partial 
transpose, and thus possesses, at most, just bound entanglement [77]. Looking for a more 
satisfactory explanation for the exponential speedup, the quantum discord [203, 138] was 
calculated, of which the amount found was a constant fraction of the maximum possible 
[80], regardless of the parameter settings for the model. In this chapter, we study two 
alternative methods of studying the quantum behavior of quantum computational and 
information tasks. 
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Our results: This chapter studies the non-classical correlations found in the DQC1 states 
for trace estimation, as well as those used in the locking of classical correlations [ ], with 
respect to two quantification schemes abbreviated as LNU and MID. 

1. Locally noneffective unitaries (LNU). Locally noneffective unitary operations 
(LNU) have previously been studied with the aim of developing an entanglement detection 
criterion [102, 107] (see Section for a definition 5.2). Here, we study whether LNU can be 
used to quantify non-classicality, motivated by the disturbance of a quantum state under 
unitary operations. Specifically, we employ LNU in analyzing the DQC1 model, which 
has previously been studied using the quantum discord. Thus, we compare these two 
certificates of non-classicality, with the aim of contrasting disturbance under measurement 
with disturbance under unitary operations. We also study a mixed-state task in the setting 
of quantum communication known as locking [87], which uses two mutually unbiased bases 
(MUB) to lock classical correlations in a quantum state. For both tasks, we find that LNU 
do not indicate a high level of correlations. 

2. Measurement-Induced Disturbance (MID). We then study the DQC1 model using 
the Measurement-Induced Disturbance (MID) measure [185] in Section 5.5. Regarding the 
MID measure, in Reference [185], a preliminary analysis of the DQC1 model was begun. 
Here, we extend this analysis to the entire parameter range for the DQC1 model, including 
those which limit the DQC1 state to being at most bound entangled. This latter case is 
of particular interest due to the lack of distillable entanglement. We also study the task of 
locking. For the latter, the value of the MID measure is exactly the number of locked bits 
of correlation in the state. 

Discussion. With regards to the LNU distance, we find (Equation (5.11)) that there is 
little non-classicality in the n+1 qubit DQC1 state. This behavior is very similar to that of 
negativity [241] in the DQC1 model which was used to characterize its entanglement [77]. 
The crucial difference is that the bipartite split chosen in Section 5.3 is separable, and 
therefore exhibits no entanglement at all. As the LNU distance vanishes exponentially 
quickly with growing n, one is hard-pressed to relegate the role of the resource exponentially 
speeding up the DQC1 model to it. Similarly, the LNU distance suggests vanishing non- 
classicality in the case of locking of classical correlations in quantum states. 

We find the MID measure, on the other hand, to be considerably more satisfactory. 
The zero-entanglement split in the DQC1 model is shown to have a non-zero amount of 
non-classicality as per the MID measure. The magnitude of this measure, as shown in 
Figure (5.4), is a constant fraction of its maximum possible value. Further, the MID 
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measure performs well in quantifying non-classicality in the scenario of locking classical 
correlations in quantum states. Further studies in this direction are required before a 
comprehensive conclusion can be reached. 



Organization of chapter. We begin in Section 5.2 by defining LNU. Section 5.3 studies 
LNU in the DQC1 model. Section 5.4 shows a one-way relationship between LNU and the 
quantum discord. In Section 5.5, we define the MID measure, and use it to study the DQC1 
model in Section 5.5.1. In Section 5.5.2, we study both MID and LNU in the context of 
locking. 

Notation. Throughout this chapter, for T>(A <8> B) we denote the dimensions of A and 
B as M and N, respectively. All designations of a density matrix without any subscripts 
refers to a bipartite state. For example, p stands for pab- 



5.2 Locally noneffective unitary (LNU) operations 

We begin by introducing locally noneffective unitary operations (LNU), first proposed 
under the name local cyclic operations [102]. For this, consider a bipartite quantum state 
p G V(A<S>B), shared between A and B such that pA = Tre(p) and ps = Tr^(p). Suppose 
now that Alice performs a local unitary Ua that does not change her subsystem, that is, 
Pa = UaPaU\, or equivalent ly 

[p A ,U A ] = 0. (5.1) 

This action can, however, affect the state of the total system, such that if we define 
Pf := (Ua <8> Ib)p(Ua <8> Ib)\ it is possible that p ^ pf. Unitaries satisfying Equation (5.1) 
are called LNU [102]. To quantify the difference between p and pf, we use 

d m ax(p) := max —= || p - p f \\ F = max J^{p 2 ) - Tr(pp f ), (5.2) 

Ua: y 2 U A ■■ v 

[PA,U A ]=0 [PA,U A }=0 



where || A || F = ^Tv(A^A) denotes the Frobenius norm. From the latter expression, it is 
clear that < d max (p) < 1. 

For any product state p pro d '■= Pa® Pb, d max (p pr0( i) = 0. Closed form expressions for 
dmax(p) are known for (pseudo)pure states and Werner states [107]. As with the quantum 
discord, it is possible to have d max (p sep ) > for certain separable states, implying d max (p) 
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is not a non-locality measure. Recall that a separable state p sep G T>(A <8> <6) is defined as 
one of the form 

Psep ■= ^2pk\a k )(a k \ <g> \h)(b k \, (5.3) 

k 

where ^2 k p k = 1, and the |a/-) G *4 and |&/-) G £> are vectors of Euclidean norm 1. For 
two-qubit separable states, the maximum LNU distance attainable is [102] 



dmax(Psep) ^ 



y/2' 



(5.4) 



As an illustration, the maximum LNU distance for the two-qubit isotropic state, 

1 - z 



Pi. 



4 



-h + z\S>)(% ze [0,1] 



(5.5) 



where = (|00) + |11))/V2, is given by d max (pj SO ) = z [107]. By Equation (5.4), we 
can conclude that the two-qubit isotropic state is entangled for z > 1/a/2. The partial 
transpose test, which in this case is necessary and sufficient, shows that this state is actually 
entangled for all z > 1/3, showing that the LNU distance is weaker at detecting entangled 
states than the former. 

We remark that we have restricted our attention here to the case where the LNU is 
applied to subsystem A of p. Let us derive a simple upper bound on d max (p) which holds 
regardless of which target subsystem we choose, and which proves useful throughout this 
chapter. 



Theorem 5.1. For any p G V(A <8> £>), 



d max (p) < J 2 Tr(p 2 )- 



MN 



(5.6) 



Proof. Since || p — jj-^ || is invariant under unitary operations, we have via the triangle 
inequality that: 



P-Pfh 



< 



MN 



MN 



Pf 



MN 



2A Tr(p2 



1 



MN 



Substituting this expression in Equation (5.2) gives the desired result. 



□ 



Thus, if the purity of a state p strictly decreases as a function of the dimension, then 
dmax(p) -> as MN ->■ oo. 
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Figure 5.1: The DQC1 circuit 



5.3 LNU in the DQC1 model 

We now study the non-classical features of the DQC1 model of quantum computation, as 
quantified by d max (p). The n + 1 qubit DQC1 state for given unitary U n G U(B® n ), as 
demonstrated in Figure (5.1), is given by [77] 

"><*» - ^ ( X t" ) • (57 > 

We will consider the top qubit to be system A on which our local unitary acts and the 
remaining n qubits as system B. The reduced state is then 

PA = Tr B (p DQC1 )= 1 -^ r a [ ) (5.8) 

with r = Tr(f/„)/2 n . For an arbitrary SU(2) unitary Ua acting on A, which we characterize 
as 

e*^ cos 9 e %x sin i 



Ua ~[ -e-^sinfl eocene ) ' (5 ' 9) 

the LNU condition of Equation (5.1) requires that X = \ ~ ar g( r ) an d either cf) = or 
6 = 7r/2. Both cases lead to the same final expression, so set = 0. Via Equation (5.2) 
and simple algebra, we hence have 



a sm6 Re(Tr(e- 2 -^^ n 2 )) 
d(pDQCi,0) = ^TTWV 2^ ' ( 5 0) 



The now trivial maximization over all 9 gives 



a / Re(Tr(e _2iar s T £/ 2 )) a 
c(Pdqci) = 2(n+1)/2 y 1 ^ ~-2^2' ^• U ' 
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Here, we have used the rough estimate Re(Tr(e 2?argT L^)) > —2™. For a two-qubit pure state 
(n = 1, a = 1), we thus have d max (/?DQCi) < l/y/2, which conforms with Equation (5.4). 
A typical instance of the DQC1 circuit is provided by that of a random unitary U n in 
the DQC1 circuit of Figure (5.1). For such instances of large enough Haar distributed 
unitaries, Tr([/ 2 ) is bounded above by a constant with high probability | ]. Thus, the 
second term inside the square root in Equation (5.11) is approximately zero, and 

d max (P J DQC*l) ~ 2 (n+l)/2' ( 5 - 12 ) 

This shows that the DQC1 state experiences very little disturbance under LNU, and 
in fact this disturbance vanishes asymptotically as n grows. As discussed in the introduc- 
tion, it would appear that the quantum discord is better suited [< ] to quantifying non- 
classicality in the DQC1 model. This, however, raises the question of how the discord and 
LNU distance are related, and whether the paradigms of 'disturbance under measurement' 
and 'disturbance under unitary operations' lead to differing notions of non-classicality. We 
explore these questions in the following section. 

Before closing, for completeness, we invoke Theorem (5.1) to show that the LNU dis- 
tance is exponentially decreasing for any other choice of bi-partitions A and B of the qubits 
in Pdqcx- 111 fact, since 

MpIqci) = (5-13) 
Theorem (5.1) immediately gives the same upper bound of Equation (5.11). 



5.4 Quantum discord vs LNU distance 

Motivated by the fact that both the quantum discord and the LNU distance are aimed 
at capturing the non-classical features in a quantum state via an induced disturbance, we 
seek an answer to the question of whether one implies the other in any sense or not. Here, 
we show that non-zero quantum discord implies a non-zero LNU distance, but that the 
converse is not necessarily true. We begin by recalling the definition of quantum discord. 

Given a quantum state p G V(A <g> B), recall from Section 1.6.2 that the quantum 
discord [ 13] is defined as 

8{p) := S(p A ) -S(p)+ mm S (pb|{ii/}) ( 5 - 14 ) 
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for {nj 1 } a rank-one projective measurement, and for 

s (aiki) := 5> 5 ( (n ^ ® /B)p(n ^ ® /B) A) ' (5 - 15) 

i 

where pj = Tr(n^ ® I B p). Intuitively, quantum discord captures purely quantum correla- 
tions in a quantum state. This is distinct from entanglement in the case of mixed states. 
For pure states, quantum discord reduces to the von Neumann entropy of the reduced 
density matrix, which is a measure of entanglement. On the other hand, it is possible for 
mixed separable states to have non-zero quantum discord. The main theorem concerning 
the discord that we require here is the following. 

Theorem 5.2 (Ollivier and Zurek [: ]). For p £ T>(A <8> B), S(p) = if and only if 
p = ^2j(Hf <8> I B )p(Tlf <g) I B ), for some complete set of rank one projectors {^f} ■ 

We now show the following. 
Theorem 5.3. For p E V(A <g> B), if 8(p) > 0, then d max (p) > 0. 

Proof. We begin by writing p in Fano form [91], i.e. 

M 2 -l N 2 -l 

>T A st> T B + r A ■ ct A 6b T B + T A 6d r B ■ rr B 4- 

MN 



1 

p = —(I A ® I B + r A ■ cr A ® I B + I A ®r B ■ ct b + E T «rf ® °t )• ( 5 - 16 ) 



=i t=i 



Here, cr A denotes a (M 2 — l)-component vector of traceless orthogonal Hermitian basis 
elements (which generalize the Pauli spin operators), r A is the (M 2 — l)-dimensional Bloch 
vector for subsystem A with r A = yTr^^a^ 1 ), and T is a real matrix known as the 
correlation matrix with entries T st = -jtTr(a A ® o~ B p). The definitions for subsystem B 
are analogous. 

An explicit construction for the basis elements G{ for M > 2 is given as follows [139]. 
Define {ci}^ 1 = {U pq , V pq , W r }, such that for 1 < p < q < M and 1 < r < M — 1, and 
{|&)}fe=i some complete orthonormal basis for A: 

U m = \p)(q\ + \q)( P \ (5.17) 
V pq = -i\p)(q\+i\q)(p\ (5.18) 

W r = J^q^j f^|*><*|-r|r + l><r + l|). (5.19) 
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In our ensuing discussion, without loss of generality, we fix the choice of basis {|/c)} fc=1 
above as the eigenbasis of pa- (Note that the set of orthonormal eigenvectors of pA will not 
be unique if the eigenvalues of pa are degenerate. Hence, we fix some choice of eigenbasis 
for pa as the "canonical" choice to be referred to throughout the rest of our discussion.) 

Assume now that 5(p) > 0. Then, any choice of complete measurement {Rf} must 
disturb p, i.e. by Theorem 5.2, if we define 

M 

P/ :=^(n/®/)p(n/®/), (5.20) 

3=1 

then pf ^ p. Henceforth, when we discuss the action of {Rf} on pa, we are referring 
to the state YlfjLi ^-fp^-f- Now, let \J[f} be a complete projective measurement onto 
the eigenbasis of pa- Then, {H- 1 } acts invariantly on pa, and thus must alter the last 
term in Equation (5.16) to ensure pj ^ p. To see this, recall that one can write pA = 
jj(I a + r A ■ cr A ), from which it follows that if {H- 1 } acts invariantly on pa, then it also 
acts invariantly on r A ■ a A from Equation (5.16). Since all basis elements o A e {H / r } r are 
diagonal, it follows that there must exist some T st ^ such that o A G {U pq , V pq } . We 
now use this fact to construct a LNU U A achieving d(p, Ua) > 0. 

Define unitary U A as diagonal in the eigenbasis of pa, i-e. U A = Yjh=i e ldk \k)(k\, with 
eigenvalues to be chosen as needed. Then, [U A , pa] = by construction, and so U A Cg> I B 
must alter T through its action on p to ensure pf ^ p. Focusing on the last term from 
Equation (5.16), we thus have: 

RI 2 -lN 2 -l M 2 -lN 2 -l / M M \ 

Y, T - uA ^U A] ®<yf= E EE eiM H^|n>|m)(n|Uaf 

5=1 t=l s=l t=l \m=ln=l / 

Analyzing each af case by case, we find, for some l<p<g<Mor 1 < r < M — 1: 

@q)Vpq ^ CTs = U pg 

e q )V vq iia s = V m (5.21) 
if <7 S = W r . 



M M 



COS 



PQ 



sm 



^^e i{ - 6m ~ en) (m\a s \n)\m)(n\ = I sm(9 p - 6 q )U pq + cos(# p 



m=l n=l 
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Denoting by T* the T matrix for pf, we have: 



1 st 



cos(9 p - 6 q )T st + sin(0 p - 6 q )T wt 

if a s = Upg, where a w = V pq 
cos(6 p - 9 q )T st - sm(9 p - 9 q )T wt (5.22) 

if a s = Vpq, where a w = U pq 
^T st if a s = W r . 

Thus, if there exists an s such that T st ^ and af G {U pq , V pq } pq , it follows that one 
can easily choose appropriate eigenvalues e t9p and e t9q for U A such that T* ^ T, implying 
dmax(p) > 0. By our argument above for 6(p) > 0, such an s does in fact exist. □ 

To show that the converse of Theorem 5.3 does not hold, we present an example of a 
zero discord state that has non-zero LNU measure. Consider the two qubit separable state 

1 [I 2 + a.a I 2 + b.a I 2 - a.o I 2 - b.a\ 

^ = 2 C 2 ® 2 + 2 ® 2 J ' (5 ' 23) 

where || a || 2 = || b || 2 = 1. This state, by construction, has zero discord for a single qubit 
measurement on either A or B. To see this, consider the projective measurements 



Io ± a. cr 



(5.24) 



on A. Let us now study the LNU distance for this state, with the local unitary being 
applied to say A. Notice that Pa = Pb = h/2, and Tr(p 2 ) = 1/2. The former implies 
that any local unitary on A can be chosen, as characterized by Equation (5.9). Let us for 
convenience parameterize a = (0,0,1) and b = (sin 7 cos 5, sin 7 sin 5, cos 7). Then, some 
algebra leads to 

Tr(p P/ ) = icos 2 fl, (5.25) 



whose minimum is 0, whereby 



d max (p) = -L. (5.26) 



We thus have an example of a class of separable, zero discord states which demonstrates 
a non-zero shift under LNU. In fact, it attains the maximum shift possible for two-qubit 
separable states. Hence, if one wishes to define notions of non-classicality in quantum states 
in terms of 'disturbance under measurement' versus 'disturbance under unitary operations', 
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and one chooses discord and the LNU distance as canonical quantifiers of such effects, 
respectively, then the resulting respective notions of non-classicality are not equivalent. 
As we have shown in Theorem 5.3, however, the quantum discord is a stronger notion of 
non-classicality than the LNU criterion. 

5.5 Measuring correlations via measurement-induced 
disturbance 

The measure we intend to use in this section was presented by Luo in [185]. It relies on 
the disturbance of a quantum system under a generic measurement. In that sense, it is 
similar in spirit to quantum discord, but not quite. In the case of quantum discord, as per 
Equation (5.14), one maximizes over one- dimensional projective measurements on one of 
the subsystems. For the measure used here, which we will call the Measurement-Induced 
Disturbance (MID) measure, one performs measurements on both the subsystems, with the 
measurements being given by projectors onto the eigenvectors of the reduced subsystems. 
Then the MID measure of quantum correlations for a quantum state p G T>(A®B) is given 
by [185] 

M(p):=I(p)-I(V(p)) (5.27) 

where 

M N 

i=i j=i 

Here {11^}, {IL^} denote rank one projections onto the eigenbases of pA and ps, respec- 
tively, and X(cr) is the quantum mutual information. The measurement induced by the 
local eigenvectors leaves the entropy of the reduced states invariant and is, in a certain 
sense, the least disturbing. Actually, this choice of measurement even leaves the reduced 
states invariant [185]. Interestingly, for pure states, both the quantum discord and the 
MID measure reduce to the von Neumann entropy of the reduced density matrix, which 
is a measure of bipartite entanglement. An advantage of the MID measure is that since 
no optimizations are involved, it is much easier to calculate in practice than the quantum 
discord or the LNU distance, which involve optimizations over projective measurements 
and local unitaries respectively. The corresponding disadvantage is that if the spectrum 
of either p^ or ps is degenerate, there exist examples [259] where the MID measure is not 
necessarily well-defined, as the choice of local eigenbases is no longer unique. In this case, 
the value of the MID measure should be interpreted moreso as a rough estimate or upper 
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Figure 5.2: (Color online) The solid line is the MID measure Ai for the 2x4 Horodecki 
state from [147]. The dashed line is the quantum discord T> for the same state [79]. The 
kink in the latter curve occurs at p — 1/7. We see here, as in the case of the DQC1 state, 
that the MID measure is greater than or equal to the quantum discord. 



bound on the non-classicality of a state. We remark that for this reason, it may be more 
reasonable to consider a quantity 

M*(p) := X{p) - max X{V{p)\ (5.29) 
{nf},{nf} 

where {n^},{IT^} are again projections onto eigenbases of pa and pb, respectively. (A 
quantity similar to AA*(p) was considered in [259], except the maximization there is over 
all local POVMs. Also, note that it follows directly from the definition of Ai*(p) that it 
is an upper bound on the distillable entanglement potential of p introduced in Chapter 7 
(Equation (7.16)).) Computing Ai*(p) is naturally much more difficult; we discuss A4*(p) 
in this section where appropriate in addition to our discussion of Ai(p). 

To demonstrate the MID measure on a non-trivial example, we first consider the well- 
known Horodecki bound entangled state in 2 <g> 4 dimensions [ ] . It is bound entangled 
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for all values of < p < 1, and the state is given as 
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(5.30) 



The projectors onto the eigenvectors of the reduced density matrices can be chosen as 



{itf.itf} 
{nf,- ,nf} 



and 



(5.31) 
(5.32) 



where 1^) = (|1) ± |2))/V2 and |$ ± ) = (|0) ± |3))/V2, with {|0), |1), |2), |3)} forming the 
computational basis for the second subsystem. Using these in Equation (5.28), we have 



{l^+X^+l, |*->(*-|, |$ + )(* + |, |$-)($^|}. 
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(5.33) 



Note that this density matrix differs from pn in that some of the off-diagonal terms p have 
vanished. We have computed the MID measure for p# as Ai(pn) = S(V(ph)) — S(.Ph) 
and plotted it in Figure (5.2). In the same figure, we also plot the quantum discord for 
this state, when a measurement is made on the two-dimensional subsystem [ ]. As we 
see, there are non-classical correlations in this state that are not distillable into maximally 
entangled Bell pairs. 

As a comparison, we remark that for pn, M*{ph) behaves similarly to M.{pu)- To see 
this, note that only ps has a degenerate eigenvalue, and this is on the space spanned by 
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Figure 5.3: A plot of M* for the 2x4 Horodecki state from [147]. 



and ITf . Thus, in the minimization over local bases, one can more generally choose ITf 
and nf to project onto an arbitrary basis for this space, a\0) + e ie b\3) and 6|0) — e a|3) 
for a, b, 9 6 M, respectively. The eigenvalues of V(ph) are then (up to normalization) 

- (p + 1 ± V^-P 2 ) ,P,P,P (l ± |a| |6| VK 1 - cos(25))) ,p (l ± \a\ \b\ y/2(l - cos(20))) . 

(5.34) 

In the expression M*[p H ) = mm^ n Ay ^ n By S(V(ph)) — S(ph), the entropy S(V(ph)) is 
thus minimized by choosing a = b = l/y/2 and 9 = n/2. A plot of the resulting value of 
A4*(pn) is given in Figure 5.3. 

5.5.1 MID measure in the DQC1 model 

We now move on to calculate the MID measure in the DQC1 model. Our analysis extends 
that of [185], where only the case of a = 1 was considered. Considering a < 1/2 here will 
be of particular interest, due to the lack of distillable entanglement in the DQC1 state (in 
this regime, any bipartite split has a positive partial transpose). Consequently, we start 
with the (n + l)-qubit DQC1 state, given by Equation (5.7), wherefrom 

PA = \{l T °f ) and p B = I n /2 n , (5.35) 
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where recall r = Tr(£7 n )/2 n . The projectors onto p^'s eigenvectors can be chosen as 

{n^nf} = {|0 o )(0ol,l0i)(0i|} (5.36) 

for |0 O ) := (|0) +e**\l))/y/2 and |0i) := (|0) - e**\l))/y/2, respectively, where r = re** for 

If } = {|i)0'|} for {\j)}? = 



\t\. Similarly, set {Uf } = {\j)(j\} for {|j)}^ =1 the computational basis. Using this, 



we can calculate 

2™ 1 

v(p DQC i) = ^^(n^nf) PDQC1 (n^nf) (5.37) 

3=1 k=0 

2 n 1 

- ^iEE^ a( ^„ u> «)n^W0l. Ms) 

Observing that 

n£ ( a{m ^ j) ) itf = (l + (-l) k aRe((j\U n \j)e^)) (5.39) 
we conclude that the spectrum of V(pdqci) is given by 

MV(p DQ ci)] = {^ L } (5.40) 

for 

Aj := a Re ((j\U n \j)e~ i<l> ) (5.41) 

and j E [2 n ]. Letting \ k denote the fcth entry of \[P(pdqci)}, the von Neumann entropy 
of this state is 

2 n + l 

S{V{ PDQ ci)) = -^A fc log(A fc ) (5.42) 

k=i 

= » + 1 -^l5:("'s(l-Aj) + A J lo g (i±|i)j. (5.43) 

Now, 

S(pdqci) =n + H 2 (^-) , (5.44) 
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and since the entropies of the partial density matrices are invariant under the local mea- 
surements, we have 




M DQ ci = l{p DQ ci)-I{V{p DQC i)) (5.45) 
= S(V(p DQ ci)) - S( PDQC1 ) (5.46) 

= i - H, (i^) - ^ yj J log(1 _ A?) + A . log 

For any unitary U n , which is known in any implementation of the DQC1 circuit, the 
above quantity can be computed easily. Bounding this quantity more generally, however, 
is difficult. If, however, in the asymptotic limit of large n, |Aj| — > (as might intuitively 
be expected when U n is a Haar distributed random unitary matrix, since then we might 
expect \ujj\ ~ l/2 n / 2 ), then the whole quantity within the summation in Equation (5.45) 
goes to zero. In this case, 

M DQ ci ~ 1 - H 2 O^^j • ( 5 - 47 ) 

One fact immediately notable is that the above expression for the MID measure is indepen- 
dent of n, for large n. The result for an = 5 qubit Haar distributed random unitary matrix 
is shown in Figure (5.4). As is evident, despite the approximations used in the derivation of 
Equation (5.47) the asymptotic analytic expression matches the numerical result at n = 5 
quite well. We remark that even for a version of Ai dqci where one minimizes over all local 
POVMs, the behavior one finds is quantitatively analogous to that of M.dqci plotted in 
Figure 5.4 [259]. 

The MID measure for the DQCI state across the bipartite split separating the top qubit 
from the rest is non-zero for all non-zero values of a. Across this split, the DQCI state 
is strictly separable [ ] and possesses no entanglement. Hence, one might propose the 
MID measure as a quantifier of the resource behind the quantum advantage in the DQCI 
model [185]. Note that, as can be seen from Figure (5.4), the behavior of the MID measure 
is qualitatively quite similar to that of the quantum discord. 



5.5.2 Non-classical correlations in quantum communication 

We now use the MID measure to study the locking of classical correlations in quantum 
states. It has been shown [ ] that there exist bipartite quantum states which contain a 
large amount of locked classical correlation which can be unlocked by a small amount of 
classical communication. More precisely, there exist (2n + l)-qubit states for which the 
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0.2 0.4 0.6 0.8 1 



Figure 5.4: (Color online) The upper solid (cyan) line is the MID measure M. (Equa- 
tion (5.45)) for the DQC1 circuit for a n = 5 qubit Haar distributed random unitary 
matrix. The upper dashed (blue) line is the analytic expression for the MID measure for 
certain DQC1 states from Equation (5.47). The lower dashed (red) line shows the discord 
T> in the DQC1 circuit with the same unitary. The lower solid (green) line shows the ana- 
lytical expression of the quantum discord from [80]. All quantities are shown as functions 
of the purity a of the control qubit. 



optimal classical mutual information between measurement results on the subsystems can 
be increased from n/2 bits to n bits via a single bit of classical communication. Despite 
the impossibility of this feat classically, the states used in the protocol are not entangled. 

Here we use the MID measure to study this purely quantum phenomenon. To do so, 
we evaluate the former on a generalization of the state used in [87], 

1 dm 

P=^dH £(I*X*I ® I*X*I)a ® (\KM\)b, (5-48) 

k=l t=l 

f d ~l m 

where the set of m orthonormal bases |{|&fc)} fc=1 | is mutually unbiased (MUB), i.e. 

V^t'^j | (bl\bj) | = 1/y/d. As in Reference [87], when d = 2 n and m = 2, the initial 
correlations in this state amount to n/2 bits, and by Alice's sending one bit (the bit t) 
to Bob, they end up with n + 1 correlated bits. The state being separable, it has no 
entanglement. Consequently, we cannot ascribe to entanglement the advantage exhibited 
by this protocol. 
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To calculate the MID measure of this state, we need the reduced states given by 



Choosing the local eigenvectors as the respective computational bases, we have that V(p) 
is simply the diagonal of p. Thus, 

MHp)} = ^{lry ,1,1/dr-- A/d,0,0, r - ,0} (5.50) 

whereby 




(m-l)d 2 d(d-l) 
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S{V{p)) = log m + ( 2 - — ) log d. (5.51) 



The spectrum of p is given by 

i\ = 

md 



x[p] = iiiiiC-^i'2^C-^ } (5 ' 52) 

md md(d-l) 



which leads to 
Finally, we have 



S(p) = logm + logd. (5.53) 



M( P ) =s(V( P ))-S(r)= (i_ I j log,/. 

m 



which for d = 2 n and m = 2 is the exactly equal to the gain attained by this scheme. 
Moreover, once Bob receives Alice's bit, the MID measure for their post-communication 
state drops to 0, the latter being diagonal in a local product basis. This suggests the 
possibility that the MID measure quantifies the non-classical (yet not entanglement-based) 
correlations in p which were initially locked. Moreover, we remark that for d = 2 n and 
m = 2, we have M.{p) = Ai*(p) — this follows directly from the result [87] that the mutual 
information of any classical distribution induced via local measurements on p is at most 
(logd)/2. 

A few remarks are in order. Equation (5.54) might suggest that a better locking effect 
may be possible for m > 2. However, explicit constructions to date using more than two 
MUBs have been unable to achieve superior locking [3 ], suggesting that the choice of 
construction for the MUBs plays an important role. In contrast, Equation (5.54) holds 
irrespective of the specific choice of MUBs. It is also known that if the bases above are 
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constructed using a large set of random unitaries chosen according to the Haar measure, 
then the classical mutual information in p between Alice and Bob can be brought down to 
a constant [137]. There is also numerical evidence (Appendix of Reference [87]) that the 
dimension of the systems may play a role in achieving better locking. Connections between 
locking and non-classical correlations have since been discovered in References [259, 49]. 

Finally, for completeness, we remark that Tr(p 2 ) = l/(md), and so by Theorem 5.1, 
the LNU distance for p is bounded by 



Thus, in contrast to the MID measure, the LNU distance once again reveals vanishing 
non-classicality with growing m or d. 

Acknowledgements for this chapter. We thank Carl Caves and Anil Shaji for numerous 
stimulating discussions, as well as an anonymous referee for raising certain points that led 
to improvements in the paper this chapter is based on. 
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Chapter 6 



Quantifying non-classicality with 
local unitary operations 

This chapter is based on [106]: 

S. Gharibian. Quantifying non-classicality with local unitary operations. Available at 
arXiv.org e-Print quant-ph/1202.1598vl, 2012. 

In this chapter, we propose a measure of non-classical correlations in bipartite quantum 
states based on local unitary operations. We prove the measure is non-zero if and only 
if the quantum discord is non-zero; this is achieved via a new characterization of zero 
discord states in terms of the state's correlation matrix. Moreover, our scheme can be 
extended to ensure the same relationship holds even with a generalized version of quantum 
discord in which higher-rank projective measurements are allowed. We next derive a closed 
form expression for our scheme in the cases of Werner states and (2 x iV)-dimensional 
systems. The latter reveals that for (2 x iV)-dimensional states, our measure reduces to 
the geometric discord [75]. A connection to the CHSH inequality is shown. We close with 
a characterization of all maximally non-classical, yet separable, (2 x iV)-dimensional states 
of rank at most two (with respect to our measure). 

6.1 Introduction and results 

One of the most intriguing aspects of quantum mechanics is quantum entanglement, which 
with the advent of quantum computing, was thrust into the limelight of quantum infor- 
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mation theoretic research [151]. We now know that correlations in quantum states due 
to entanglement are necessary in order for pure-state quantum computation to provide 
exponential speedups over its classical counterpart [ ]. With bipartite entanglement 
nowadays fairly well understood, however, attention has turned in recent years to a more 
general type of quantum correlation, dubbed simply non-classical correlations. Unlike 
entanglement, such correlations can be created via Local Operations and Classical Com- 
munication (LOCC), but nevertheless do not exist in the classical setting. Moreover, for 
certain mixed-state quantum computational feats, the amount of entanglement present can 
be small or vanishing, such as in the DQC1 model of computing [174] and the locking of 
classical correlations [87]. In these settings, it is rather non-classical correlations which are 
the conjectured resource enabling such feats (see, e.g. [77, 80, 185, 78]). In fact, almost all 
quantum states possess non-classical correlations [94]. 

As a result, much attention has recently been devoted to the quantification of non- 
classical correlations (e.g., [187, 119, 196, 118, 217, 185, 209, 188, 13, 216, 75, 231, 208], 
see [195] for a survey, and Section 1.6.2 for a brief exposition). Here, we say a bipartite 
state p acting on Hilbert space A <8> B is classically correlated in A if and only if there 
exists an orthonormal basis {\a}} for A such that 

p = ^2 / Pi\a i )(a i \® pi (6.1) 

for {p^ a probability distribution and pi density operators. To quantify "how far" p is from 
the form above, a number non-classicality measures, including perhaps the best-known such 
measure, the quantum discord [ 33, 138], ask the question of how drastically a bipartite 
quantum state is disturbed under local measurement on A. In this chapter, we take a 
different approach to the problem. We ask: Can disturbance of a bipartite system under 
local unitary operations be used to quantify non-classical correlations? 

It turns out that not only is the answer to this question yes, but that in fact for (2 x 
iV)-dimensional systems, the measure we construct coincides with the geometric quantum 
discord [ 5], a scheme based again on local measurements. Our measure is defined as 
follows. Given a bipartite quantum state p and unitary Ua acting on Hilbert spaces A <S> B 
and A with dimensions MN and M, respectively, define 



where the Frobenius norm || A || F = VTtjVA is used due to its simple calculation. Then, 
consider the set of unitary operators whose eigenvalues are precisely the M-th roots of 



D(p,U A ):=-= P -{U a ®Ib)p(U ] a ®I b ) 
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unity, i.e. whose vector of eigenvalues equals v for Vk = e 27Tkt ^ M for 1 < k < M. (The 
corresponding eigenvectors can be chosen arbitrarily.) We call such operators Root-of- 
Unity (RU) unitaries. They include, for example, the Pauli X, Y, and Z matrices (see 
Section 1.4.3). Then, letting RU(^4) denote the set of RU unitaries acting on A, we define 
our measure as: 

D(p):= mm D(p, U A ). (6.3) 
Note that < D(p) < 1 for all p acting on A <g> B. 



Our results: In this chapter, we show the following regarding D(p). 

1. Closed form expressions. Our first result is a closed-form expression for D(p) for 
(2 x iV)-dimensional systems (Theorem 6.3). This reveals that for (2 x iV)-dimensional p, 
D(p) coincides with the geometric discord of p. It also allows us to prove that, like the 
Fu distance [102, 107] (defined below in Previous Work), if D(p) > l/y/2, then p violates 
the Clauser-Horne-Shimony-Holt (CHSH) inequality [' ] (Corollary 6.5). We also derive 
a closed form expression for D(p) for Werner states, finding here that D(p) in fact equals 
the Fu distance of p (Theorem 6.6). 

2. States achieving D(p) = 1. We next show that only pure maximally entangled states 
p achieve the maximum value D(p) = 1, as expected (Corollary 6.8). 

3. D(p) is faithful. We show that D(p) is a faithful non-classicality measure, i.e. it 
achieves a value of zero if and only if p is classically correlated in A (Theorem 6.10). 
To prove this, we first derive a new characterization of states with zero quantum discord 
based on the correlation matrix of p. We then show that the states achieving D(p) = 
can be characterized in the same way. More generally, by extending our scheme to allow 
the eigenvalues of Ua to have multiplicity at most k, we prove a state is undisturbed under 
Ua if and only if there exists a projective measurement on A of rank at most k acting 
invariantly on the state (Theorem 6.11). This reproduces in a simple fashion a result of 
Reference [197] regarding entanglement quantification in the pure state setting. Based on 
this equivalence between disturbance under local unitary operations and local projective 
measurements, we propose a generalized definition of the quantum discord at the end of 
Section 6.6. 



4. Maximally non-classical, yet separable states. Finally, we characterize the set 
of maximally non-classical, yet separable, (2 x iV)-dimensional p of rank at most two, 
according to D(p) (and hence according to the geometric discord) (Lemmas 6.13 and 6.14). 
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Previous work: The Fu distance, denned as the maximization of Equation (6.2) over 
all Ua such that [Ua,Ttb(p)] = 0, was defined in Reference [102] and studied further in 
References [107] and [78] with regards to quantifying entanglement and non-classicality. 
Despite its strengths, such as a closed form solution for two-qubit systems and Werner 
states, and a connection to the CHSH inequality, the distance has weaknesses: It can at- 
tain its maximum value even on non-maximally entangled pure states [107], and is not a 
faithful non-classicality measure [ ]. Interestingly, our D(p) eliminates these weaknesses 
while preserving the former strengths. Subsequent to the conception of our scheme, the 
present author learned that there has also been an excellent line of work studying (the 
square of) Equation (6.3) in another setting — that of pure state entanglement. In Refer- 
ence [112], it was found that in (2 x N) and (3 x N) systems, D{\if})(if)\) coincides with 
the linear entropy of entanglement. Reference [197] then showed that for arbitrary bipar- 
tite pure states, D(|^)(^|) 2 is a faithful entanglement monotone, and derived upper and 
lower bounds in terms of the linear entropy of entanglement. Finally, alternative charac- 
terizations of zero discord states have been given in [203, 75, 76]. Maximally non-classical 
separable two-qubit states have been studied, for example, in [110, 114]. For example, the 
set of such states found [110] with respect to the relative entropy of quantumness matches 
our characterization for D(p)\ we remark, however, that our analysis for D(p) in this regard 
is more general than in [110] as it is based on a less restrictive ansatz. We remark that 
since the initial posting of the paper this chapter is based on, a related work by Streltsov 
et al. has appeared [229]. 



Discussion and open questions: Our results show that local unitary operations can 
indeed form the basis of a non-classicality measure with certain desirable properties. In 
particular, the scheme we consider is faithful, correctly identifies maximally non-classical 
states, and reveals interesting connections to a number of quantifiers of correlations, such 
as the Fu distance, the quantum discord, the geometric quantum discord, and the relative 
entropy of quantumness. As outlined above, the strengths of our scheme include a closed 
form for two-qubit states and Werner states, the former of which reveals a link between 
the paradigms of "disturbance under local unitary operations" and "disturbance under 
local measurements" by reducing to the geometric discord for two-qubit states. This link 
is further strengthened by the demonstration of connections to even generalized versions 
of the quantum discord. 

We leave open the following questions. For what other interesting classes of quantum 
states can a closed form expression for D(p) be found? Can a better intuitive understand- 
ing of the interplay between the notions of "disturbance under local measurements" and 
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"disturbance under local unitary operations" be obtained in higher dimensions? We give 
an analytical characterization of all maximally non-classical rank-two (2 x iV)-dimensional 
separable states — we conjecture that higher rank two-qubit states, for example, achieve 
strictly smaller values of D(p). Can this be proven rigorously and analytically? (We re- 
mark that a numerical proof for this conjecture was given in [114] for the geometric discord, 
for example.) What can the study of the generalized notion of quantum discord we define 
in Section 6.6, S v (p), tell us about non-classical correlations? 

Organization of this chapter: We begin in Section 6.2 with necessary definitions and 
useful lemmas. Closed forms for (2 x N) -dimensional systems are given in Section 6.3 
and for Werner states in Section 6.4. Section 6.5 characterizes the set of states achieving 
D(p) = 1. Section 6.6 shows that D(p) is faithful. In Section 6.7, we discuss maximally 
non-classical separable states. 

6.2 Preliminaries 

We begin by reviewing notation specific to this chapter, followed by relevant definitions 
and useful lemmas. Throughout this chapter, we use A and B to denote complex Euclidean 
spaces of dimensions M and N, respectively. We define pa '■= Tr B (p) and p# '■= Tr^(p). 
The anti-commutator of A and B is {A, B} = AB+BA. The notation diag(v) for complex 
vector v denotes a diagonal matrix with iih diagonal entry t>j, and span({vj}) denotes the 
span of the set of vectors {vj}. 

Moving to definitions, in this chapter we often decompose p G T>(A <8> B) in terms of a 
Hermitian basis for H(A <g> B) (sometimes known as the Fano form [91]): 

p= {I A ®I B + r A -a A ®I B + (6.4) 

M 2 -l N 2 -l 

i=i j=i 

Here, a A is a (M 2 — l)-component vector of traceless orthogonal Hermitian basis elements 
of satisfying Tr(afa A ) = 25^, r A G M. AI _1 is the Bloch vector for subsystem A with 
rf = ^-Ti(pAO'f), and T G ]R( M2_1 ) x ( Ar2_1 ) is the correlation matrix with entries Ty = 
^Tr(of (g) afp). For M = 2, r A satisfies < || r A || 2 < 1 with || r A || 2 = 1 if and only if 
Pa is pure. The definitions for subsystem B are analogous. 
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We now give a useful specific construction for the basis elements [139]. Define 
Wj£ _1 = {U pq , V pq , W r }, such that for 1 < p < q < M and 1 < r < M - 1, and {|z)}^ 
some orthonormal basis for ^4: 

tf* = |p>(g| + k><p| (6.5) 

V pq = -i\p)(q\+i\q){p\ (6.6) 
W r = J . 2 ^ \Y\k)(k\-r\r + l)(r + l\ 1 . (6.7) 



r(r + 1) 



v fc=i 



Note that when M = 2, this construction yields the Pauli matrices cr A = (X, Y, Z). 

Regarding D(p), defining pj := (Ua ® I B )p(U A <8> I B ), we often use the fact that Equa- 
tion (6.3) can be rewritten as: 



Dip) = min -v/Trfp 2 ) — Tr(pp f ). (Q.i 

u A eRV(A) v 



Finally, we show a simple but important lemma. 
Lemma 6.1. D(p) is invariant under local unitary operations. 

Proof. Let p' := (Va <8> Vb)p(Va <8> Vb)^ for unitaries Va, Vb- Then in Equation (6.8), 
Tr(// 2 ) = Tr(p 2 ), and Tr(p'p^) becomes 

Tr(p(\/j^V^ <8> Ib)p(V\U a Va ® I B )). (6.9) 

Observe, however, that V A U A V A is still an RU unitary, since we have simply changed basis. 
Hence, D(p', Ua) = D(p, V a UaVa), and since we are minimizing over all Ua £ RU(^4), the 
claim follows. □ 



6.3 (2 x iV)-dimensional states 

In this section, we study D(p) for p £ D(C 2 ® C ), obtaining among other results a closed 
from expression for D(p). To begin, note that any Ua £ RU(^4) must have the form 

U A := \c){c\ - \d)(d\ = 2\c)(c\ - I 2 , (6.10) 
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up to an irrelevant global phase which disappears upon application of Ua to our system, 
and for some orthonormal basis {\c), \d)} for C 2 . Then, D(p, Ua) can be rewritten as 

2v/Tr[p 2 (|c)(c| <g>/) -p(\c)(c\ ®I)p(\c)(c\ <g>/)]. (6.11) 

We begin with a simple upper bound on D(p). 
Theorem 6.2. For any p G £>(C 2 <g> C N ), one has 

D(p) < 2v/A min (Tr B (p 2 )). (6.12) 

Proof. Starting with Equation (6.11), by noting that Tr[p(|c)(c| £g> I)p(\c)(c\ (g) I)] > and 
using the fact that Tr(p(C J 4 <g> Ib)) = Tr^^C^), we have that D(p) is at most 

min 2v/Tr[Tr B (p 2 )|c)(c|] = 2v/A min (Tr B (p 2 )). □ (6.13) 

unit |c)eC 2 

Theorem 6.2 implies that for pure product G C 2 ®C Ar , D(\if))(ifj\) = 0, in agreement 
with the results in Reference [112]. By next exploiting the structure of p further, we obtain 
a closed form expression for D(p). 

Theorem 6.3. For any p G V(C 2 <g> C*), define G := r A (r A ) T + ^TT T , for T the 
correlation matrix of p. Then, D(p) equals 

^ v / Tr(G)-A max (G) = -±=y/\ 2 (G) + \ 3 (G). (6.14) 



Proof. Define P := \c)(c\. Then, beginning with Equation (6.11), by rewriting p using 
Equation (6.4) and applying the fact that the basis elements crj are traceless, we obtain 
that Tr (p 2 P ® / - pP ® IpP ® J) equals 

-Tr^-^ + Ag-^), (6.15) 



where 



4iV 



Ai := (E r ^) P ' A 2 :=^r^pj (6.16) 



^3 := ^(>:W®*f| (P®J) (6.17) 



A, 



^ (E ^ ® °f) (E ® °f) • (6-18) 
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Using the facts that (af) 2 = I, {erf, erf} = for i ^ j, Tr^e^) = 2%, and Tr(P) = 1, 
we thus have 

Tr (^i) = h A \\l Tr (^) = |E^ ( 6 - 19 ) 

Tr(A 2 ) = ^^(cl^lcXcl^lc) (6.20) 

Tr(A 4 ) = |e(E T ^) (c|^|c>(c|^| C >. (6.21) 

ij \ k / 

Now, (c|er^|c) can be thought of as the ith component of the Bloch vector v G I 3 of pure 
state |c) with || v || 2 = 1, implying 



Tr(y4 2 + A A ) = v 



T 



r A( r A )T + |_ TT T 



(6.22) 



Plugging these values into Equation (6.11), we conclude D(p) equals 



1,1111 4fJll rA ll' + ^E^- Tr ^ + ^4). (6.23) 



veR 3 \/N\ N 
l|v|| a =l V « 

The claim now follows since for any symmetric A e IR nxra , max un i t v gm™ v t Av = X max (A). 

□ 

The expression for D(p) in Theorem 6.3 matches that for the geometric discord [75, 242]. 
Specifically, defining the latter as S g (p) = min CT{E n p - a || F , where Q is the set of zero- 
discord states, we have for (2 x iV)-dimensional p that D(p) = S g (p). (Note: The original 
definition of Reference [75] was more precisely 5 g (p) = min CT6 n || p — a || F .) 

We now discuss consequences of Theorem 6.3, beginning with a lower bound which 
proves useful later. 

Corollary 6.4. For p e V(C 2 <g> C^), we have 

^(p)>^V / A 2 (TT^) + A 3 (TT^). (6.24) 
This holds with equality if v A — 0, i.e. p& = \- 
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Proof. The first claim follows from the fact that: 

A max (W A f + ^TTA < || r^ \\l + |A max (TT T ) . (6.25) 

The second claim follows by substitution into Equation (6.14). □ 

For example, for maximally entangled \ip) = ( 1 00) + \ll))/\/2, for which r B = and 
T = diag(l, — 1, 1), Corollary 6.4 yields D(\^)(^\) = 1, as desired. We also remark that 
Equation (6.14) can further be simplified for two-qubit states, since by Reference [148, 149], 
one can assume without loss of generality that T is diagonal. This relies on the facts that 
(1) applying local unitary V\ ® V 2 to p has the effect of mapping T \-t 0{FO\, r A i— > 0\r A , 
and r B i— > 0^c B for some orthogonal rotation matrices 0\ and 2 , and (2) D(p) is invariant 
under local unitaries by Lemma 6.1. 

Using Corollary 6.4, we next obtain a connection to the CHSH inequality for two-qubit 
p. Defining M(p) := \\{T T T) + A2 (T T T), it is known that p violates the CHSH inequality 
if and only if M(p) > 1 [150]. We thus have: 

Corollary 6.5. For p G £>(C 2 <g> C 2 ) ; if D(p) > l/y/2, then M(p) > 1. The converse does 
not hold. 

Proof. The first is immediate from Corollary 6.4 and the fact that TT T and T T T are 
cospectral (Theorem 1.3.20 of [ ]). The converse proceeds similarly to Theorem 7 of 
Reference [10' | - namely, let \ip) = a\00) + 6|11) for real a, b > and a 2 + b 2 = 1. 
Then, for density operator |^))(^|, we have r B = (0, 0, a 2 — b 2 ) and T = diag(2a&, —2ab, 1), 
implying M > 1 for a, b 7^ 0. In comparison, D(\i/>){ip\) = 2ab < l/y/2 when 

a ^ or a * yfi+&- D 

Interestingly, the exact same relationship as that in Corollary 6.5 was found between 
the Fu distance and the CHSH inequality in Reference [107]. 



6.4 Werner states 

We now derive a closed formula for D(p) for Werner states p G V(C d ® C d ) where d > 2, 
which are defined as [254] 
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for P s := (I + P)/2 and P a := (/ — P)/2 the projectors onto the symmetric and anti- 
symmetric subspaces, respectively, P := Ylt j=i\i) (j\ ® b')(*l the SWAP operator, and 
< p < 1. Werner states are invariant under U ®U for any unitary Z7, and are entangled 
if and only if p < 1/2. 

Theorem 6.6. Let p E V(C d £g> C d ) be a Werner state. Then 

DM = |2 ^ 2 -_ d - 11 (6.27) 

Proof. As done in Theorem 3 of Reference [107], we first rewrite Equation 6.8 using the 
facts that Tr(P) = d, Tr(P 2 ) = d 2 , and f5 := Tr(P(U A ® I)P(U A ® = Tr(?7 A )Tr(?7i) 
to obtain that for any Ua&U (A), 



D{p, U A ) = . (6.28) 

Since Tt(Ua) = for any U a £ RU(^4), we have /3 = and the claim follows. □ 

Again, we find that this coincides exactly with the expression for the Fu distance for 
Werner states [107]. Further, Theorem 6.6 implies that the quantum discord of Werner 
state p is zero if and only if p = (d + l)/2d. This matches the results of Chitambar [67], 
who develops the following closed formula for the discord S(p) of Werner states: 

1 — p p 
8(p) = log(d + 1) + (1 - p) log - — -+p\og 



2P log^-fl-AVog ^j - (6-29) 



d + 1 & ^ V d + lj & 2(d- 1) 

In Section 6.6, we show that this is no coincidence — it turns out that D(p) = if and 
only if the discord of p is zero for any p. 



6.5 Pure states of arbitrary dimension 

We now show that only pure maximally entangled states p achieve D(p) — 1. As mentioned 
in Section 6.1, this is in contrast to the Fu distance [102, 107], whose maximal value is 
attained even for certain non-maximally entangled We remark that Theorem 6.7 

below also follows from a more general non-trivial result that D(\if))(if)\) 2 is tightly upper 
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bounded by the linear entropy of entanglement of pure state \ip) [197]. However, our proof 
of Theorem 6.7 is much simpler and requires only elementary linear algebra. 

To begin, assume without loss of generality that M < N, and let \ip) G A ® B be a 
pure quantum state with Schmidt decomposition = J2k=i a k\ a k) ® \bk), i.e. ^2 k al = 1 
for a k G K and {la^)} and {|6fc)} the Schmidt bases for A and B, respectively. 

Theorem 6.7. Let \ip) G A®B with Schmidt decomposition as above. Then D(\ip)(ip\) = 1 
if and only if a k = -7= for all 1 < k < M (i.e. is maximally entangled) . 



Proof. We begin by rewriting Equation (6.8) as 



mm a 

U A £RU(A) \ 



^2a 2 k (a k \U A \a k ) 



k=l 



(6.30) 



If |^) is maximally entangled, then a k 
RU(^), Equation (6.30) yields 



1/VM for all 1 < k < M. Then, since U A G 



mm 

U A £KU(A) 



M 2 



|Tr(C/^ 



(6.31) 



For the converse, assume D(\if))(if)\) = 1. Then, by Equation (6.30), we must have that 
for all U A G RU(i), 

M 

J2 a l( a k\U A \a k ) = 0. (6.32) 

k=l 

Thus, choosing V a as diagonal in basis {|cife)}, Equation (6.32) says that w T 7rv = for all 
permutations 7r G Sm, where w k := oa and v k := e 2nki/M . This can only hold, however, if 
all entries of w are the same, i.e. a k = 1/vM for all 1 < k < M, as desired. □ 

Corollary 6.8. A quantum state p G T>(A®B) achieves D(p) = 1 if and only if p is pure 
and maximally entangled. 

Proof. Immediate from Theorem 6.7 and the Tr(p 2 ) in Equation (6.8). □ 
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6.6 Relationship to quantum discord 



We now show that for arbitrary p G T>(A (g> £>), D(p) is zero if and only if the quantum 
discord [203, 138] S(p) of p is zero. (The discord S(p) was defined in Section 1.6.2.) 

The main fact we leverage about the discord here is the following. 
Theorem 6.9 (Ollivier and Zurek [203]). For p G V(A ® B), 5(p) = if and only if 

p = U f ® lB P U f ® /B > ( 6 - 33 ) 

/or some complete set of rank 1 projectors {n^}. 

We now prove the main result of this section. The first part of the proof involves a new 
characterization of the set of zero discord quantum states p in terms of the basis elements 
af from the Fano form of p. Key to this characterization is the absence of non-diagonal af 
in the expansion of p. In the proofs below, we assume the basis elements af for A come 
from the set {/, U pq , V pq , W r } from Section 6.2 (analogously for B). 

Theorem 6.10. Let p G T>(A <8> B). Then 5(p) = if and only if there exists a local 
unitary V A such that 

Tr {[V A ® I B ) p (V A] ® J B ) (af ® af)\ = (6.34) 

for all af G {U pq , V pq } A and all af G {/, U pq , V pq , W r } B . The same characterization holds 
for t){p) 0. 

Proof. We prove the equivalent statement that 5(p) = if and only if there exists an 
orthonormal basis {\k)} for A such that, for basis elements af constructed with respect 
to {\k}}, we have Tr(p(af ® af)) = for all af G {U pq , V pq } (and similarly for D(p) = 0). 

Suppose S(p) = 0. Then by Theorem 6.9, there exists a complete set of rank 1 projectors 
{n^} such that Equation (6.33) holds. Let {\k)} be the basis onto which {n^} projects, 
and define $(C) := ^.H^CHf. By constructing the basis elements af in Equation (6.4) 
using we thus have 

P = ^[l A ®I B + I A ®r B -a B + (6.35) 

M 2 -l / N 2 -l 

i=i V j=i 
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Now, for all of G {W^}, we clearly have $(crf) = of. For of G {U pq ,V P q}, however, 
Q(of) = 0. Thus, in order for Equation (6.33) to hold, we must have rf = = for 
all basis elements of G {U pq , V pg }, which by definition means Ti{p{of <8> erf)) = for all 
erf G {Upq, V pq } A , as desired. To show that this implies D{p) = 0, construct U A G RU(^4) 
as diagonal in basis {\k)} and define $(C) := U A CU A . Then since in Equation (6.35), we 
have §{of) = of for any erf G {/, W r }, the claim follows. 

To show the converse, assume D(p,U A ) = for some U A G RU(.A). Then, construct 
the basis elements of with respect to a diagonalizing basis {\k)} for U A and define 3>(C) : = 
U A CU Ai . It follows that for any p and q, 

$(c/ M ) = e ^-e ? )| p )( g | + e -i(^)| ? ^| ; (oe) 

= -ie^-^lpXgl+ie-^-^lgXp]. (6.37) 

Consider now an arbitrary term (c u of+c v of) ®crf from the Fano form of p where of = U pq 
and o„ = V P q for some choice of p and q. Since Equations (6.36) and (6.37) imply that U A 
can only map U pq to V pq and vice versa, it follows that in order for D(p, U A ) = to hold, 
we must have Q(c u of + c v of) = c u of + c v of. This leads to the system of equations 

c u -ic v = e l ^~ e "\c u -ic v ) (6.38) 
c u + ic v = e- i{9p - e ^{c u + ic v ). (6.39) 

We conclude that if either c„ ^ or c„ ^ 0, it must be that 9 P = 6 q in order for D(p) = 
to hold. However, since all eigenvalues of U A are distinct by definition, this is impos- 
sible. Thus, Tr(p(of (g) of)) = for all of G {U pq ,V pq }, as desired. To see that this 
implies 8(p) = 0, simply now choose {nf} as the projection onto {|&)}. Then, defin- 
ing $(C) := Y2j n/Cn^ 1 and applying the same arguments from the forward direction to 
Equation (6.35), we conclude that p is invariant under {n^ 1 }. By Theorem 6.9, we have 
5(p) = 0, completing the proof. □ 

Theorem 6.10 shows that D(p) defined in Equation (6.3) is zero precisely for the set 
of states classically correlated in A. In other words, unlike the Fu distance [ 8], D(p) 
is indeed a faithful non-classicality measure. The proof of Theorem 6.10 does, however, 
have a curiosity — the key property the proof relies on is that all U A G RXJ{A) have 
non-degenerate spectra. Interestingly, this is the mixed-state analogue of the pure-state 
result of Reference [197], where it was shown that a non-degenerate spectrum suffices to 
conclude D(\ip){ip\) is a faithful entanglement monotone for pure states |?/>). Specifically, 
Reference [197] shows that if in Equation (6.3) we minimize over U A with eigenvalues of 
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multiplicity at most k (with at least one eigenvalue of multiplicity k), then D(\ip)(ip\) = 
if and only if \if>) has Schmidt rank at most k. Could there be an analogue of this more 
general result in the mixed-state setting of non-classicality? It turns out the answer is yes. 

Let v G N M such that YljLi v jj = M. Then, consider an arbitrary (i.e. not nec- 
essarily RU) unitary which has precisely Vj distinct eigenvalues with multiplicity j. 
For example, G RU{A) has v = (M, 0, . . . , 0) since it has M distinct eigenvalues of 
multiplicity 1. Similarly, if v = (0, 0, . . . , 1), then is just the identity (up to phase), 
and if v = (M — 4, 2, . . . , 0) then has M — 4 distinct eigenvalues of multiplicity 1, 
and two distinct eigenvalues with multiplicity 2 each. Now, corresponding to any is a 
complete projective measurement {rLj 1 } which consists precisely of Vj projectors of rank 
j. The correspondence is simple: Let A be an eigenvalue of with multiplicity j, i.e. 
the projector II a onto its eigenspace has rank j. Then Ha G {iLj 1 }^. It is easy to see that 
similarly, corresponding to any {iLj 1 } is a (assuming we are not concerned with the 
precise eigenvalues of U^, as is this case here). We can now state the following. 

Theorem 6.11. Let p G V(A ® B) and v G N M such that Ylf=i v j3 = M - Th en, there 
exists a complete projective measurement {IT^} such that 

p = J2ttf® lB pRf ® lB (6-40) 

j 

if and only if there exists a G U(A) with D(p, U^) = 0. 

Proof. The proof follows that of Theorem 6.10, so we outline the differences. Here, and 
{n^} v will be related through the correspondence outlined above, and the basis elements 
of are constructed with respect to a diagonalizing basis {\k)} for (which by definition 
also diagonalizes each Tlf G {irj 1 } ). For simplicity, we discuss the case of v = (M — 
2, 1, 0, ... , 0); all other cases proceed analogously. 

Going in the forward direction, suppose lif G {llj 1 } v projects onto S pq := span(|p), \q)). 
Then, in Equation (6.35), &(o~f) = af for of = U pq and af = V pq . In other words, now we 
can have rf ^ and ^ (however, note we still have r^L^ = and T m ^ it j = 0). Since 

has a degenerate eigenvalue on S pq , however, we have by Equations (6.36) and (6.37) 
that acts invariantly on af as well (since 9 P = 6 q ). The converse is similar; namely, 
suppose has a degenerate eigenvalue on S pq . Then the projector onto the corresponding 
two-dimensional eigenspace Uf G {n/} v is Uf = \p)(p\ + \q)(q\. It thus follows by the 
same argument as above that both and iLy act invariantly on U pq and V pq . □ 
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From this general theorem, we can re-derive as a simple corollary the pure state result 
of Reference [197] mentioned earlier, which we rephrase in our terminology as follows. 

Corollary 6.12. Let = J2l=i ^iyPt^^f) be the Schmidt decomposition of\ip) G A®B. 
Then, there exists £ U(A) with v k > 1 (i.e. has an eigenvalue of multiplicity k), 
vw>k = (all eigenvalues of have multiplicity at most k), and D{\ij))(il)\, U^) = if 
and only if k > r. 

Proof. Suppose k > r. Then, by defining {IT^}^ such that v k > 1 and v^>k = 0, one can 
choose a {11^}^ such that Equation (6.40) holds for p = \if)){if)\ (i.e. simply project onto 
span({ V'i 4 )}))- By Theorem 6.11, this implies there exists a with v\~ > 1 and vy^k — 
achieving D(\ip)(ip\, U^) = 0. Conversely, if k < r, then clearly no such {ilj 1 }^ such that 
Equation (6.40) holds exists. By Theorem 6.11, this implies that no XJ a with an eigenvalue 
of multiplicity at most k and D(\ip) (iP\,Ua) = exists, as desired. □ 

We close this section with two final comments. First, given Theorem 6.10, one might ask 
whether a stronger relationship between D(p) and S(p) holds. For example, could it be that 
D(p) > S(p) for all p? This simplest type of relationship is ruled out easily via Theorem 6.6 
and Equation (6.29), since for d = 2 and p = 2/3, D(p) = 1/9 > 8{p) « 0.01614, while for 
d = 50 and p = 2/3, D(p) w 0.00627 < 6(p) w 0.07111. 

Second, note that Theorem 6.11 reduces to Theorem 6.10 if we choose v = (M, 0, . . . , 0). 
This suggests defining a generalized quantum discord, denoted S y (p), which is analogous to 
S(p), except that now we use the class of measurements {nj 1 }^ in the definition of discord 
(see Equation 5.14). For example, 5(m,o,...,o)(p) — <Kp)- We hope the study of S v (p) would 
prove fruitful in its own right. 



6.7 Maximally non-classical, yet separable, (2 x N)- 
dimensional states 

In this section, we characterize the set of maximally non-classical, yet separable, (2 x N)- 
dimensional states of rank at most 2, as quantified by D(p). To do so, consider separable 
state 

n 

P = y^Pi|ai)(oj| <g) (6.41) 

i=l 
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where YliPi — 1> \ a i) £ C 2 , G C^. Via simple algebraic manipulation, one then finds 
that D(p, Ua) for any given Ua G U(A) is given by 



\ 



J]^ TO |(6 l |6 J )| 2 (|(« 1 |a J )| 2 -|(« i |t/i|« j )| 2 ). (6.42) 
i=i i=i 



We begin by proving a simple but useful upper bound on D(p) which depends solely on n. 

Lemma 6.13. Let p be a separable state as given by Equation (6.4-1)- Then D(p) < 
1 — maxj pi < 1 — - . 

Proof. Assume WLOG that m&XiPi = p\. Then 1/n < p\ < 1. Choose any Ua G U(A) 
such that |ai) is an eigenvector of Ua- Then any term in the double sum of Equation (6.42) 
in which \ai) appears vanishes. We can hence loosely upper bound the value of Equa- 



tion (6.42) by yjCE^tftPiPj) — 1 — Pi- Recalling that p\ > 1/n yields the desired 
bound. □ 

When n — 2, i.e. when p is rank at most two, observe from Lemma 6.13 that D(p) < 1/2, 
and this is attainable only when p\ — pi — 1/2. We now show that this bound can indeed 
be saturated, and characterize all states with n = 2 that do so. 

Lemma 6.14. Let p be a separable state as in Equation (6. 41) with p\ = P2 = 1/2. Then 
D(p) = 1/2 if and only if \( ai \a 2 )\ = l/V? and (&i|&a) = 0. 

Proof. Since by Lemma 6.1, D(p) is invariant under local unitaries, we can assume without 
loss of generality that |ai) = |0), |&i) = |0), \a 2 ) = cos ||0) +sin ||1) and 1 62} = J^iLo 1 a i\^) 
for G [0,7r] and «j G M with £V af = 1, i.e. we can rotate the local states so as to 
eliminate relative phases. Further, since Ua G RU(^4) in Equation (6.42), we can write 
U A = 2\u)(u\ -I for some \u) = cos||0) +e^sin||l), where 9, <f> G [0,2tt). Via the latter, 
we can rewrite Equation (6.42) as: 



\ 



2 

^ (6j|&j) 2 ((aj|aj) 2 - \(ai\aj) - 2(a i \u)(u\a j )\ 2 ). (6.43) 



Letting A denote the expression under the square root above, we have by substituting in 
our expressions for |oi), \a>2), \bi), I&2), and \u) and algebraic manipulation that 

A = a 2 [2 cos @ sin 2 9 - sin f} sin(20) cos 0] + 

1 + sin 2 9 - (cos cos 9 + sin sin 9 cos 0) 2 . (6.44) 
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Our goal is to maximize A with respect to «o and (3 (which define p), and then minimize 
with respect to 9 and (which define Ua)- Observe now that choosing = 9 = reduces 
Equation (6.44) to A = 1 — cos 2 /3. Hence, unless (3 = n/2 (i.e. |(ai|a 2 )| = I/a/2), we 
can always achieve D(p) < 1/2. Thus, set = ir/2. Consider next = 0, and leave 9 
unassigned. Then, Equation (6.44) reduces to A = 1 — «Qsin(2^), from which it is clear 
that unless oq = (i.e. (61I&2) = 0), we can always achieve D(p) < 1/2. Plugging these 
values of a and (3 into Equation (6.44), we have A = 1 + sin 2 9 sin 2 0, from which the claim 
follows. □ 

For two-qubit p, we thus have that with respect to D(p) and the geometric discord, the 
maximally non-classical two qubit states of rank at most two are, up to local unitaries, 

i|0)<0|®|0><0| + i|+>(+|(8i|l>(l|, 

where |+) = (|0) + |1))/ y/2. As mentioned earlier, this matches known results with respect 
to the relative entropy of quantumness [110]. However, the latter analysis is not as general 
as it begins by with the assumption that (61J62) = 0, whereas we allow arbitrary j^)- 
It would be interesting to know whether this analysis can be extended to arbitrary rank 
two-qubit states. 

Acknowledgements for this chapter. We thank Gerardo Adesso, Dagmar Brufi, Davide 
Girolami and Marco Piani for helpful discussions. 
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Chapter 7 



All non-classical correlations can be 
activated into distillable 
entanglement 

This chapter is based on [208]: 

M. Piani, S. Gharibian, G. Adesso, J. Calsamiglia, P. Horodecki and A. Winter. All non- 
classical correlations can be activated into distillable entanglement. Physical Review Let- 
ters, 106:220403, 2011, DOI: 10. 1103/PhysRevLett. 106. 220403, © 2011 American Physical 
Society, prl.aps.org. 

In this chapter, we introduce a protocol through which general non-classical multipartite 
correlations can be mapped or "activated" into bipartite entanglement. In particular, we 
provide an operational interpretation for the measure of non-classicality known as the 
relative entropy of quantumness, showing that it quantifies the minimum distillable entan- 
glement generated between the initial system and ancillae in our protocol. Moreover, we 
show the following surprising fact: That mixed entangled states can be arbitrarily more 
non-classical than separable and pure entangled states. 

7.1 Introduction and results 

The study of quantum correlations has traditionally focused on entanglement [11 ]. In 
particular, it is generally believed that entanglement is a necessary resource for quantum 
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computers to outperform their classical counterparts. Indeed, it has been shown that for 
the setting of pure-state computation, the amount of entanglement present must grow with 
the system size for an exponential speed-up to occur [11 ]. In the context of mixed-state 
quantum information processing, however, there are surprising quantum computational 
and communication feats which are seemingly impossible to achieve with a classical com- 
puter, and yet can be attained with a quantum computer using little or no entanglement. 
Examples include the DQC1 model of computing [174] and the locking of classical correla- 
tions [87]; see Section 1.6.2 for a brief exposition. In the case of locking, for example, the 
task involved is impossible classically, and yet the quantum states used are separable. This 
raises the question: What is the fundamental resource enabling such feats? 

One plausible explanation is the presence in (generic [9 J]) quantum states of non- 
classical correlations beyond entanglement. Indeed, as outlined in Section 1.6.2, much 
attention has recently been devoted to understanding and quantifying such correlations for 
this reason [203, 138, 187, 119, 196, 118, 217, 185, 54, 209, 188, 94, 13, 216]. In particular, 
the separable quantum states of the systems involved in DQC1 and the locking protocol 
have been shown to possess non-zero amounts of such correlations (see e.g. [80, 78]), as 
measured by the quantum discord ['. )3, 138]. The latter strives to capture non-classical 
correlations beyond entanglement and has recently received operational interpretations in 
terms of the quantum state merging protocol [62, 190], but is unfortunately not a faithful 
measure (here, a faithful measure achieves a non-zero value of zero if and only if a state is 
"non-classical" ) . A more accurate quantification of non-classical correlations is provided by 
the so-called relative entropy of quantumness (REQ) [54, 187, 118, 217, 196], defined as the 
minimum distance, in terms of relative entropy, between a multipartite quantum state and 
the closest strictly classically correlated state (see Definition 7.1). Such a measure is faithful 
[118], symmetric under permutation of the subsystems, and enables a unified approach to 
the quantification of classical, separable and entangled correlations [196]. However, to date 
it still lacks an operational interpretation. 

More generally, in this chapter, we ask the following question: Is there a protocol by 
which general non- classical correlations produce a physically relevant effect that distin- 
guishes them from purely classical ones? 

It turns out that the answer to the above question is not only yes, but that among other 
results, the protocol we derive lends the desired operational interpretation to the REQ. 

Our results: In order to summarize our results, recall first from Equation (1.128) the 
definition of a strictly classically correlated or classical state in the bipartite setting. For 
completeness, we state the generalization of this definition to the multipartite setting 
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below [209]. 

Definition 7.1 (Strictly classically correlated quantum state). Let p G T>({€, d )® n ), i.e. p 
acts on n d- dimensional systems. Let B% := {\bi(j))}jZl denote some orthonormal basis 
for C d for the ith system, and let B denote the orthonormal basis 

{\b(k)):=\b 1 (k 1 ))\b 2 (k 2 ))...\b n (k n ))} (7.1) 

for the entire space (C d )® n formed by taking tensor products of all elements in bases {Bi}™ =1 . 
Here, k := kik 2 ■ ■ ■ k n is a number written in base d. We henceforth use the notation B 
to refer to such a local product basis. Then, an n-qudit state p is strictly classically 
correlated, or classical, if there exists a local product basis B with respect to which p is 
diagonal. 

Recall that classical states correspond to the embedding of a multipartite classical proba- 
bility distribution into the quantum formalism, and that states not of the form above are 
called non-classical. We now summarize our results as follows. 

1. An "activation" protocol for non-classical correlations. Our first result is a 
protocol through which non-classical correlations are mapped into entanglement. Roughly, 
given an input state p G X>((C d )® n ), the protocol first introduces an ancilla state 1 - ■ ■ 0) G 
(C d )® n . We then show that p is non-classically correlated if and only if applying local 
CNOT gates with system i of p as control and system i of the ancilla as target always 
creates (distillable) entanglement across the system-ancilla split, even if one adversarially 
applies local changes of basis to p before applying the CNOT gates (Theorem 7.3). 

We thus not only have a physical effect arising from non-classical correlations, as desired, 
but also an entire framework for designing non-classicality measures. Specifically, for each 
choice of entanglement measure one applies across the system-ancilla gap after the protocol 
is run, we have the potential for a new non-classicality measure for system p. 

2. Connections to non-classicality measures. As mentioned above, by applying 
our favorite entanglement measure across the system-ancilla cut after our protocol is run, 
we have the potential for discovering new non-classicality measures for the initial system 
p. In this vein, we first find that applying the entanglement measure distillable entan- 
glement [210], we obtain a non-classicality measure we call the minimum distillable en- 
tanglement potential, which turns out to equal the REQ (Corollary 7.5). We thus have 
an operational interpretation for the REQ. We also consider the negativity [241] as an 
entanglement measure, obtaining various results of interest here (Section 7.4.2). 
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3. Mixedness versus entanglement in non-classicality. Our final result studies the 
minimum distillable entanglement potential (or equivalently, REQ). As might be expected, 
we first find that according to this non-classicality quantifier, pure entangled states are 
strictly "more non-classical" than separable states. However, perhaps surprisingly, we 
next show that in the asymptotic setting, (1) separable states can be as non-classical as 
pure entangled states (Theorem 7.10), and (2) mixed entangled states can be much more 
non-classical than pure entangled states (Theorem 7.11)! This suggests that non-classical 
correlations arise not just from the superposition principle of quantum mechanics, as is 
the case with (pure state) entanglement, but also due to the non-commutative nature of 
quantum physics. Our proofs here use ideas similar to known concentration of measure 
arguments [136, 137]. 

Previous work. We refer the reader to Section 1.6.2 for a brief introduction to non- 
classical correlations. With regards to this chapter, we remark that after completion of the 
paper this chapter is based on, we became aware of related results by Streltsov, Kamper- 
mann and Brufi [231]. They show that the quantumness of correlations (as measured, for 
example, by the quantum discord) is also related to the minimum entanglement generated 
between system and apparatus in a partial measurement process. In light of those results, 
our findings can be understood also as dealing with the interplay between system-apparatus 
entanglement and non-classicality of correlations when realizing local measurements. 

Discussion and open questions. The study of general non-classical correlations is cur- 
rently a burgeoning area, but in many ways such correlations are still not well-understood. 
Our activation protocol lends new insight into the nature of these correlations by furnishing 
them with a new operational meaning in terms of resources for entanglement generation. 
One natural and interesting open question is whether the ideas behind the protocol could 
lead to novel applications in quantum computation and information. 

Furthermore, our novel framework for non-classicality measures reduces the problem of 
non-classicality quantification to the more familiar setting of entanglement quantification, 
for which a multitude of tools for analysis are already known (see e.g. [11 ]). An open 
question here is what further known non-classicality quantification schemes can be obtained 
as arising through our framework? 

Finally, that mixing can actually help surpass the quantumness of pure-state entan- 
glement, and that the latter can be asymptotically matched by fully separable states is, 
in our opinion, quite a surprising result. It would be good to better understand the non- 
commutative nature of states in a quantum mixture, both from the perspective of non- 
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classical correlations, as well as with regard to computational and information theoretic 
feats. 



Organization of chapter. In Section 7.3, we describe our activation protocol, and show 
how it yields a connection between entanglement and non-classical correlations. Section 7.4 
then exploits this connection further by introducing an entire family of non-classicality 
quantifiers, demonstrating along the way an operational interpretation of the REQ. In 
Section 7.5, we show two surprising results in systems of large local dimensions: That 
mixed separable states can be asymptotically as non-classical as pure maximally entangled 
states, and that mixed entangled states can be asymptotically twice as non-classical as 
pure maximally entangled states. 



7.2 Preliminaries 

We now state notation and a lemma specific to this chapter. Regarding notation, given a 
local product basis B = {\b(k))} and multipartite quantum state p, we define 

p B :=J2Hk))m\p\b(k)}(b(k)l (7.2) 

k 

and 

p B hl :={b(k)\p\b(l)). (7.3) 

We next state Levy's Lemma, which is useful in Section 7.5. For this, we first define the 
Lipschitz constant of a function /. Given function J:l4F for metric spaces (X, dx) 
and (Y,dy), where dx and dy are metrics on the sets X and Y, respectively, we say that 
/ has Lipschitz constant m > if the distance between any two input points in X does 
not increase by more than m after going through /. In other words, for all X\,x 2 € X, 

d Y (f(xi),f(x 2 )) < m ■ d x (x 1 ,x 2 ). (7.4) 

Then, for § fc the fc-sphere and E(/) the expected value of function /, we can state the 
following useful Lemma, known as Levy's Lemma. 

Lemma 7.2 (Levy's Lemma, see e.g. [136]). Let f : § fc i— y R be a function whose Lipschitz 
constant with respect to the Euclidean norm is m > 0. Let x G § fc be chosen uniformly at 
random. Then, for some constant c > 0, 

Pr (f(x) - E(f) ^±a)<2 exp f -< k + 1 )°^ \ . (7 . 5) 
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Figure 7.1: (Color online) Scheme of the activation protocol for n — 3. 



7.3 The activation protocol 

We now describe our protocol for the activation of non-classical correlations, which maps 
relatively "not-well-understood" non-classical correlations into "more familiar" bipartite 
entanglement, allowing one to employ tools from entanglement theory [151] to study general 
non-classical correlations. The protocol can be thought of as a game between an adversary 
and n players, where the n players together aim to generate an entangled state between a 
system A they control and an ancillary system A', and the adversary's goal is to thwart 
their efforts by locally rotating each subsystem of A before system and ancilla undergo a 
pre-defined interaction. 

More precisely, the protocol proceeds as follows (see Figure 7.1). We consider n players 
Vi, each controlling a system-ancilla pair of qudits (Ai,A^). We indicate by A the joint 
register A±, . . . , A n , henceforth called the "system" , and by A' the joint register A[, . . . , A' n , 
henceforth called the "ancilla" . The initial state of the total 2n qudits is a tensor product 
Paa 1 = Pa® |0) (0|®r« For a given an adversary is first allowed to apply a local unitary Ui 
of his choice to each A{. With the adversary's turn complete, each player Vi now lets their 
subsystem Ai (control qudit) interact with the corresponding ancillary party A\ (target 
qudit) via a CNOT gate, whose action on the computational basis states \ j)\f) of C d £g>C d 
is defined as \j)\f) \j)\f © j), with © denoting addition modulo d. The final state of 
system plus ancilla is 

^ = V(pA®|0)(0|y)Vt, (7.6) 
where V = CNOT AA ,{U A ® I A ,), U A = ®? =1 Ui and CNOT AA , = ® 1 * =l CNOT A . Al .. We 
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ask: At the end of the protocol, have the n players succeeded in generating bipartite 
entanglement across the split A : A', and, if so, how much entanglement was created? It 
is natural to expect that the answer will depend on the initial state p& of the n-qudit 
system. For simplicity of notation, in the remainder of this chapter, we shall take p and 
pf to denote the states pa and p AA ,, respectively. 

Although we cast the activation protocol as a game, from a more physical perspective 
our aim is to understand precisely how the nature and amount of correlations between the 
parts Ai of the system A affects the entanglement that can be created with an ancilla A 1 
via the paradigmatic entangling operation — the CNOT; we are considering here the worst 
case scenario with respect to the choice of the control bases. We then find the following. 

Theorem 7.3. A state p of an n-qudit system is classical if and only if there exists an 
adversarial choice of local unitaries U A such that the state p* output by the activation 
protocol is separable across the system- ancilla (i.e. A: A') split. 

Proof. The "if" part is trivial, as given a strictly classically correlated state, one can 
choose Ua to rotate the diagonalizing local product basis for p = J2iPi\K^))(^(^)\ m t° the 
computational basis, so that applying the CNOTs in our protocol straightforwardly yields 
the separable state 

P f = ^2Pi\i)(i\ a® l^l A>- ( 7 - 7 ) 

i 

As for the "only if" part, consider the separable decomposition 

i 

which exists by hypothesis for some choice of Ua- Since the transformation V in Equa- 
tion (7.6) is unitary, we must be able to write p = ^ qi\vi)(vi\ A for some ensemble (not 
necessarily a spectral decomposition) {g i5 \vi)} such that 

V\vi) A \0) A , = \A) A \<i>i) A , (7.9) 

Letting {\j)} denote the computational basis, we now expand \vi) in the basis 
such that 

\vi)A = Y,<*atf A \j) A1 (7.io) 

3 

from which it follows that 

^>A|0>A' = E a ^>AbV (7-11) 

3 
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Combining this with Equation (7.9), we conclude that for all i, there must exist a j such 
that otij = 1. Denote this value of j as ji, and note hence that 

\vi) A \0) A , = (| Ji ) A | Ji ) A/ ) = UlltijO)^. (7.12) 

We can now write 

P = ^2<li\vi)(vi\ A = ^qiU A \ji}(ji\ A U Aj (7.13) 

i i 

which is a spectral decomposition for p with respect to the computational basis up to local 
unitary U A , as desired. □ 

In other words, the system always becomes (for any choice of Ua) entangled with the 
ancilla as a result of the activation protocol, if and only if the input state of the system is 
non-classically correlated. This establishes a qualitative equivalence between multipartite 
non-classical correlations among components of a quantum system, and bipartite entan- 
glement between the system and an ancilla. 



7.4 Quantifying non-classicality 

We now exploit the spirit of Theorem 7.3 further to quantify, rather than simply detect, 
the presence of non-classical correlations in a quantum state. To do so, our approach is to 
apply entanglement measures across the A : A' split to study the amount of entanglement 
generated whenever A is initially in a non-classically correlated state. It is worth remarking 
here that this framework is general enough to possibly uncover a full zoology of non- 
classicality measures, as each choice of a different entanglement monotone [210] we adopt 
(at the output) has the potential to lead to a unique non-classicality measure (for the input 
state), the association being provided exactly by the activation protocol. 

More precisely, let E denote some entanglement measure of choice and the system- 
ancilla state at the end of the protocol as in Equation (7.6), and define by 

Q E (p):= mm E A:A ,(p f ) (7.14) 

Ua 

the minimum entanglement generated across the A : A' split over all choices of adversarial 
local unitaries U A . We call Qe{p) the minimum entanglement potential of p with respect 
to E. As a consequence of Theorem 7.3, Qe is a measure of non-classical correlations 
for arbitrary multipartite qudit states p, induced by the entanglement monotone E. In 
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fact, the condition Qe(p) = perfectly characterizes the set of classically correlated states 
p if E is a faithful entanglement measure (i.e. if E vanishes only for separable states). 
However, even certain non-faithful entanglement measures can be plugged in to obtain a 
faithful measure of non-classical correlations. The reason is that the output state p* has 
the so-called maximally correlated form [212] between A and A'; namely, 

/=E^i*>< z u®i fc >< i U' ( 7 - 15 ) 

kl 

with pjf, = (b(k)\p\b(l)), \b(k)) = U\\k) and \k) = \h)\k 2 ) ■ ■ ■ \k n ) . We now exploit this 
observation in the next section. 



7.4.1 Minimum distillable entanglement potential 

Let us consider the non-faithful but physically motivated distillable entanglement E^> [210 
as a bipartite entanglement monotone (recall Ejy is non-faithful as it vanishes on so-called 
bound entangled states). Note that the precise definition of Ed is not required here; rather 
we utilize results of [ ] linking -Ed(p) to the relative entropy of entanglement. Specifically, 
we have the following. 

Theorem 7.4. The minimum distillable entanglement potential Qe d (p) equals 

Q ED (p) = mm (s(p B )-S(p)), (7.16) 
where the minimization is over the choice of local product bases B. 

Proof. The claim follows by observing that for any choice of B, the A : A' distillable 
entanglement of pf is equal to 

Eoipf) = S(Tr A ,(pf)) - S(pf) = S(p B ) - S(p), (7.17) 

where S(a) = — Tr(crlogcr) is the von Neumann entropy of a state a. In the first equality 
we used the results of [140] about distillable entanglement for maximally correlated states 
- for which it happens to coincide with the relative entropy of entanglement [239, 238]. 
The second equality is justified by the fact that p B is the state resulting from local projec- 
tive measurements in the local bases B on p and is unitarily equivalent to Tr^/(p^) (seen 
by considering Equation (7.15)), while p* is obtained from p via the activation protocol 
isometry, Equation (7.6). □ 
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This yields the following nice corollary regarding the REQ, which is defined as (see also 
Section 1.6.2) 

Q{p) = min S(p\\a), (7.18) 

classical a 

for S'(pIIct) = Tr(plogp — plogcx) the relative entropy and where the minimization is over 
all strictly classically correlated states a. 

Corollary 7.5. The REQ of p equals its minimum distillable entanglement potential, i.e. 

Q(p) = QeM- (7-19) 



Proof. The claim follows immediately from the result that, as proven for example in 
Theorem 2 of [196], the REQ can alternatively be expressed as the expression in Equa- 
tion (7.16). □ 

This finding immediately provides a clear-cut operational interpretation for the REQ, 
which therefore emerges as a natural, mathematically sound and physically motivated 
measure of non-classical correlations in quantum states of arbitrary-dimensional composite 
systems. The degree of non-classical correlations as quantified by the REQ, a measure 
whose original definition was purely geometric [196], is quantitatively reinterpreted as the 
resource power of such correlations for the task of generating distillable entanglement with 
an ancilla in the worst case scenario. Incidentally, since the REQ is faithful [118], this can 
be considered an alternate proof of Theorem 7.3. 

Before closing this section, we prove a strict upper bound on the non-classicality of 
separable bipartite quantum states with respect to Qe u - 

Theorem 7.6. Consider bipartite separable state pab — YliPi\ a i){ a i\ ® IAXAI; f or {Pi} a 
probability distribution and {|ai)}, Q C d . Then, Q(pab) < logrf. 

Proof. We have 

Q{p AB ) = mm(s{ P B AB )-S{ PAB )) (7.20) 

< mm (s{p B AB ) - S{p A )) (7.21) 
= mm (S{ P B /) + (b A {i)\p A \b A {i))S{af B ) - S{p A )) (7.22) 

i 

< rmn J2p?S(o-? b ), (7-23) 
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where 



" := t MOWMO) l»»(»»<»«WI- 

where {pf } are the eigenvalues of p A , the first inequality follows since for any separable 
state, S(pab) > max.{S(pA), S(p.b)} [201], and the second inequality by choosing Ba as an 
eigenbasis of pa (yielding S(p A A ) = S(pa))- 

Suppose now, for sake of contradiction, that this upper bound is equal to log d. Then, 

13 

it must be the case that a i B is maximally mixed for all i, implying that pb is also maxi- 
mally mixed. Reversing the role of A and B, an analogous argument yields that pa must 
be maximally mixed as well. This means that the basis chosen in the second inequal- 
ity is arbitrary, and we find that for the last line to be equal to logrf, it must be that 
(bA(i)bB(j)\pAB\bA(i)bB(j)) = 1/d 2 for all bA,bs and all Thus, pab = I/d 2 . However, 
this state is classical, and hence achieves Q(pab) = 0, yielding the desired contradic- 
tion. □ 



7.4.2 Negativity of quantumness 

The next entanglement monotone we consider in our scheme is the Negativity [241]. The 
latter is defined for a bipartite state pab as N(pab) '■= (|| Pab || tr — -0/2> f° r Pab the 
partially transposed state. Plugging J\f into our framework, we obtain a non-classicality 
measure we call the negativity of quantumness, Qm{p)- 

Theorem 7.7. For the negativity of quantumness, Qn, we have that 

Q M {p) = l -mmY J \p- j \- (7-25) 

Proof. Thanks to the maximally correlated form of the output of our protocol, by directly 
applying the definition of the partial transpose to pf , we can calculate the eigenvalues of 
(pf ) TA as p b u for all i, and ±|p^| for i > j. Thus, 

\\(pf) T A\\ _i y^.\ P B .\ 

M( p f) = 11 {P ) = (7.26) 

□ 
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We remark that since, by definition, a non-classical state must have some non-vanishing 
off-diagonal terms p® in any local product basis B, we thus obtain yet another proof of 
Theorem 7.3, i.e. that p* is entangled for any local rotation U a if and only if p is not 
classical. 

Next, for the special case of pure bipartite states l^), we find that has a particularly 
simple form, in that it reduces to the negativity of 

Corollary 7.8. For rank one bipartite states l^)^, 

Qj*{\il>)(1>\)=tf(\1>)(4>\)- (7-27) 

Proof. Note first that for \ip) = Yli a i\ a i)\bi) the Schmidt decomposition of one has 
J\f(\i/})(i/}\) = Y^tej We now show that Qn(\i/;) matches this expression. 

For any local product basis £>, one can write \ip) = Yltj=o a ij\ b i(^))\ b 2(j)) ■ Then, letting 
p = and beginning from Equation (7.15), straightforwardly applying the definitions 

of the trace norm and partial transpose yields in Equation (7.26) that 




(7.28) 



Note here that the coefficients are specific to the choice of basis B — thus, our goal is 
to choose B so as to minimize Ylij \ a ij\- We claim that this minimizing basis is in fact just 
the tensor product of the local Schmidt bases for 

To see this, we use the vec mapping [246], which can be defined such that vec(|o)(6|) = 
\a)\b) (and analogously, vec _1 (|a)|6)) = |a)(fe|), and the l\ norm, defined as ||C||j : = 
Y^ij \Cij\. Define now 

5>#i(z)>i& 2 0j> = x^imoxwi- ( 7 - 29 ) 

ij=0 J ij=0 

Then, we have 

Ew = n cf iiii^ii C7 iitr = E^ ( 7 - 3 °) 

ij i 
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for {o"j} the singular values of C. Here, the claim || C ||j > || C || tr follows since 

IICIL = max Tr((2M-I)C) (7.31) 

< max |(vec(2M-/),vec(C))| (7.32) 

< max ||vec(2M- /) II II vec(C) IL (7.33) 

< ||cj| Jlf (7.34) 

where the second inequality follows from the Holder inequality, the third inequality from 
the fact that || vec(2M — J) || is at most the spectral norm of 2M — I, and where || v || : = 
maxj \vi\ and || w || x := J2i \ v i\- 

The final step is to observe that the <j\ are in fact the Schmidt coefficients of since 
if C = &i\a>i) (h\ is the singular value decomposition of C, then vec(C) = J2i (T i\ a i)\^i) 
is a Schmidt decomposition for \ip). Since {\a,i) g) \bj)} is a valid local product basis b in 
which to expand \if)), by combining Equations 7.28 and 7.30 the claim follows. □ 

We finally extend our analysis for pure states to the setting of pseudo-pure states 

p(^, P ):=^^-I + p\^M (7.35) 

where < p < 1. 

Corollary 7.9. For pseudo-pure state p(ip,p), we have 

Q N {p)=pU{\^)^\). (7.36) 



Proof. Follows immediately from Equation (7.25) and Corollary 7.8 by recalling that pfj = 
(b(i)\p\b(j)). * ~ ' ^ □ 

Hence, as already observed in, for example, Reference [118], p(ip,p) is non-classical as long 
as p > and ip is entangled. 



7.5 Non-classicality, mixedness, and entanglement 

Equipped with a faithful and operational measure of non-classical correlations, Qe d , which 
we henceforth refer to as Q, we now investigate the interplay between non-classicality, 
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entanglement and mixedness of general states pa- For the sake of simplicity, from now on 
we restrict to the bipartite case A± = A, A 2 = B. We begin by setting the stage with a 
few simple but general observations following from the definition of Q. 

For pure states pab = IV'XV'I, Q(pab) reduces to the von Neumann entropy of entan- 
glement S(pa) = S(pb) [54], and is thus at most equal to logd On the other hand, for 
arbitrary mixed pab-, we have that Q(pab) is at most 21og<i, since from Equation (7.18) 
one has Q(pab) < S(p A b\\pa ® Pb) = S(p A ) + S(p B ) - S(pab) = I(pab), where I denotes 
the mutual information, a measure of total correlations. From this and the results of [201], 
one realizes that for a separable state a bound Q(p B 2 B ) < logc? holds. Now recall from 
Theorem 7.6 that this inequality is always sharp for separable states, i.e. the bound logrf 
cannot be exactly saturated for separable non-classical states, while it is instead trivially 
reached by pure maximally entangled states \ip) = d~ 1 ^ 2 ^„-Z 

Surprisingly, what we now show is that as al — > oo, this upper bound is in fact asymp- 
totically attained by separable states. More precisely, we show that there exist separable 
states such that Q(p A ^)/ log d —> 1 with growing d. Even more intriguingly, we can show 
that the upper bound on general mixed bipartite states pab is also asymptotically tight; 
specifically, there exist families of mixed states for which as d — > oo, Q(pab)/ 'logd —> 2. 

More formally, we prove the following two results, where m := [(logc/) 4 ]. 
Theorem 7.10. Define the random separable state: 

aAB = ^L £ ML® { U Mi\U]) B , (7-37) 

i=l,...,d 
j = l,...,m 

for unitaries Uj drawn independently from the Haar measure. Then, with high probability, 
Q(o-ab) > logd- (log log d). 

Theorem 7.11. For C a system of dimension m, let pab = I^cI^X^Ubc w here G 
C d <S> <C d ® C m is uniformly distributed (with probability induced by the Haar measure). 
Then, with high probability, Q(pab) > 21og<i — O (log log d). 

What these results tell us is that, first, there are separable states that asymptotically 
(in d) are as non-classical as the most non-classical pure state (which is the maximally 
entangled state); second, mixed entangled states can be much more non-classical (namely, 
twice as much) than pure entangled states. We remark that therefore both entanglement 
and mixedness are required to "break the barrier" of log d. This goes against the intuition 
that entanglement by itself is the strongest form of non-classicality: We demonstrate that 
mixedness also plays a prominent role and can make correlations maximally non-classical. 
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One possible explanation for these findings may be the following: Traditionally, the 
study of quantum correlations has focused on quantum entanglement, which arises from 
the superposition principle of quantum mechanics. Yet, there is another "non-classical" 
feature of quantum mechanics to be reckoned with; namely, that quantum systems can 
be in probabilistic mixtures of non-commuting states. What Theorem 7.10 thus quantifies 
is the extent to which non-commutativity alone can give rise to non-classical correlations. 
When non-commutativity is then combined with the superposition principle, Theorem 7.11 
tells us that the non-classical correlations generated are stronger than possible with either 
principle alone. 

We close this section with the proofs of Theorems 7.10 and 7.11. 

Proof of Theorem 7.10. The claim will follow by showing that in Equation (7.16), S(<j A b) < 
logd + logm, whereas for d sufficiently large and with high probability, S{a AB ) > 21og<i — 
const, for all B. The first of these bounds is easy to prove — it follows by observing that 
the rank of gab is at most dm. 

As for the second bound, note first that 

S(a B AB )=S(a B )+S(a B )-I(a B AB ) = 2logd-I(a B AB ), (7-38) 

which follows since a A = o~ B = I/d. Hence, it suffices to show that I(cr AB ) < const.. To see 
this, note that a AB is almost identical to the information locking states considered in [137] 
(see Theorem V.l, Equation (64)), which take the form 

p AB = ah £ lyXy'Li® (um\v?) B - ( 7 - 39 ) 

i=l,...,d ' 
j=l,...,m 

Letting x and y denote random variables corresponding to the outcomes of local measure- 
ments X and Y on A and B, respectively, define I c (pab) '■= maxx^y /(x : y). Then, it is 
known that for large enough d and with high probability over the choice of local unitaries 
{Uj}, I c (pab) < const, (specifically, for our choice of m here, set the parameter e in Equa- 
tion (66) of [ ] to scale as 1/logd). Observing that cab is attainable from p AB via a 
local operation (namely, we trace out the register containing label j in A), and recalling 
that the mutual information is non-increasing under partial trace completes the proof. □ 

Proof of Theorem 7.11. The claim will follow by showing that, in Equation (7.16), S(p) < 
logm, whereas for d sufficiently large and with high probability, S(p B ) > 21ogc? — const, 
for all B. Again, the first of these bounds follows simply because the rank of p is bounded 
by m. 
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Now, let M and N denote arbitrary complete von Neumann measurements on A and 
B, respectively, such that 

M = {M x = K)K|}ti, N = {N y = \n y )(n y \} d y=1 . (7.40) 

For such a measurement M, let I1m(o") denote the completely positive trace-preserving 
linear map 



Il M (a) = J2 M ^ M =c- (7-41) 



x=l 



To prove the desired bound of S(p B ) > 21og<i — const., we use the concentration of 
measure results of [136] (see also [137]). The intuition is as follows. We first consider a 
fixed set of local measurement bases M and N. Then, one can show that for the random 
state \ip) and the corresponding state p in the statement of the theorem, the expected value 
of S(Hm <8> IIjv(p)) is at least roughly 21og<i We then convert this into a high-probability 
statement using Levy's Lemma (Lemma 7.2), which yields that with high probability, 
S(Hm <8> Iljv(p)) will indeed be close to its expected value. This was for a fixed choice of 
local measurements M and N — to extend this statement to all such measurements, we 
use the union bound together with a net argument. Specifically, we cast a 8-net over all 
choices of local measurements M and N, and apply the union bound to conclude that for 
all measurements from this net, S(I1m <8> Hn(p)) will still be close to its expected value 
with probability bounded away from 1. 

To begin, let M and iV be a fixed choice of local measurement bases. We first lower 
bound the expected value of S(JIm <S> IIjv(p)) over random choices of |^) as follows: 

d 

S(U M ^U N (p)) > 5 2 (n M ®n w (p)) = - log [Tr(\i>)(i>\{M x ®N y ®I))) 2 , (7.42) 

x,y=l 

where ^(cr) = — log(Tr(cr 2 )) is the quantum Renyi entropy of order 2, and the last equality 
follows since for rank one M x and N y , 

Tr [(M x <g> N y pf] = [Tr(M x ® N y p)f . (7.43) 

Hence, 



^[S(TIm®H-n{p))] > -logE 



d 



[Tr(\i>)(i>\(M x ®N y ®I))f 



.x,y=l 



(7.44) 



-log (d% [Tr(|^)(^|(|0)(0| ® |0)(0| ® J))] 2 ) (7.45) 
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where the first statement follows by the convexity of — log, and the second since the 
distribution of is invariant under unitaries. Now, 



<f% [P&CIVWIWIOXOL® \m\ B ®Ic))] 2 } (7.46) 

abc ® IV'XV'I A' b'c(|00)(00Ua' ® ® ^cc))] (7.47) 

d 2 Tr(E^ [^}(VUb C ® IV'XV'L^C'] (|00)(00| AA , ® |00)(00| BB , ® W)) (7-48) 

—r^——rTr([lABC:A'B'C' + ^ ABC-.A' B'C'] (|00><00| AA , <g> |00)(00| BB , <g> / cc ,)) 

m{a z m + 1) 
m + 1 



<i 2 m + 1 



(7.49) 



where the first equality uses the fact that (Tr(Ai?)) 2 = Tr[(A <g> A) (5 <S> -B)], and the third 
equality follows since E^, ® IV'X^I] is proportional to I+W for W = 

the swap gate. (One way to see the latter is to note that a = E^, [|^)(^| <8> I'^XV'I] is 
invariant under U ®U for any unitary U, and is hence a Werner state [ 14]. Werner states, 
in turn, can be written as mixtures of the projectors onto the symmetric and antisymmetric 
spaces, Il 5 = (J + W)/2 and H A = (I - W)/2, respectively. Observing that Tr(aIl A ) = 
yields the claim.) Substituting Equation (7.49) in Equation (7.45) thus yields 

^2™ I i / i \ 

E<,[S{lL M ®IL N (p))] >log 2_>21ogd-log 1 + - =: A. (7.50) 

L v /J m + 1 \ m J 

With a lower bound on the expected value of S(Hm <g) H^(p)) in hand, we would now 
like to show that with high probability, S(Hm <E> Hn{p)) indeed takes a value close to its 
expected value. To show this, we apply Levy's Lemma (Lemma 7.2), which requires an 
upper bound on the Lipschitz constant of the entropy S(ILm ® ^n(p)) ■ The latter is given 
by the proof of Lemma III. 2 of [ ], which demonstrates an upper bound on the constant 
of \/81ogd. Thus, by Levy's Lemma: 

Pr {S(U M ® U N (p)) < A - e} < exp f-^^j , (7.51) 

for some constant c > 0. This shows that the entropy is indeed large with high probability 
for a fixed choice of local measurement basis M ® N . 

To extend this to all local measurement bases M and N, suppose we had a net of T 
basis pairs M* and N f for t G [T] able to approximate the quantity S(TLm ®T1n{p)) f° r an U 
local measurement M ® N within precision 2e. Then, by Equation (7.51) and the union 
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bound, we would have: 



— — — — 



(7.52) 



implying 



- n ogd y ) • ( 7 ' 53 ) 

Thus, if such a 2e-net with small enough T exists, then we are done. Indeed, Lemma 7.12 
shows that such a 2e-net exists with 

T<«f (7.54) 



for d G 0(1). For this value of T, since we set m = [(logrf) 4 ], the probability on the 
right side of Equation (7.53) is bounded away from 1 for large enough d, completing the 
proof. □ 

In order to complete the proof of Theorem 7.11, we finally show three lemmas required 
for the net argument above. 

Lemma 7.12. For any constant e > 0, there exists a set S := {M l £g> N l \^ =l of 'T local 
measurement bases (where M l and iV* are rank one von Neumann measurements each 
acting on d-dimensional spaces) with 



id 2 

T < [ - 7"^ ) , (7.55) 



cd? 1 2 (log df 



e 2 



(for c > a constant) such that for all p G V(C d <S> C d ) and local measurements M ® N , 
there exists a M l ® N l E S such that 

\S(U M ® ILv(p)) - S(U M t ® Uw(p)) | < 2e. (7.56) 

Proof. To construct the set 5*, we embed each local measurement basis into a unitary 
matrix, and then cast a net over unitary matrices. Specifically, recall that each local 
measurement is described by an orthonormal basis M = {\bi)} d =1 . Then, by arranging the 
vectors \bi) as columns of a matrix, we obtain a d x d unitary matrix, denoted Um, which 
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rotates the standard basis to {\bi)} i=1 . By Lemma 7.13, there exists a S-net (with respect 
to the spectral norm) for U(C d ) of size 




(7.57) 



Thus, by setting 5 = e 2 /32(log d) 2 and picking T elements {M s } and T elements {N 1 }, 
we obtain our set S of local measurement bases with size T = Tq , as in the statement of 
our claim. 

We now show that S is a 2e-net. Let M and TV be arbitrary local measurements with 
corresponding unitaries Um and Un- Then, there exist Um s and Un* in the net from 
Lemma 7.13 such that || Um — Um s < S and || Un — Un* < 5. Let V denote the 
unitary mapping basis M s <g> N l to basis M ® N . 

Now, if it were true that for all p £ V(C d <g) C d ), 

\\VpV^-p\\ ti <A6, (7.58) 
then our desired 2e-net property would follow since 

\S(lL M ®IL N (p))-S(lL M t®IL N t{p))\ = 

< 

< 



< 

for large enough d (or alternatively for < e < 1). Here, the second inequality follows from 
the estimate for the Shannon entropy H(x, 1 — x) < 2y/x(l — x), and the first inequality 
uses the Fannes-Audenaert inequality [31], which states that for p, a £ V(C d ) with r : = 

llp-^lltr /2, 

|S(p)-S(cr)| <rlog(d-l) + #(r,l-r). (7.63) 

Thus, it remains to show that Equation (7.58) indeed holds. To see this, note first that 
by Lemma 4.4, 

II U M ®U n - U m * <8> U N * ^ < || U M - U M * \\ +\\U N - U N t \\ < 25, (7.64) 



\s(il m ®ii n (j>))-s(iLm 

H(25,l -25) + 25 log d 2 
(2V25 + 25)\ogd 2 



e + 



2e 



8\ogd 



U N (VpV^)\ 
(7.59) 
(7.60) 

(7.61) 

(7.62) 
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where note the columns of Um <8> Un are now elements of the local product basis M £§> N. 
We thus have 



25 > || U M ®U n - U M s <8> U N t ^ = || U M ®U N - V\U M ® U N ) L = || / - V , 

(7.65) 

where the last equality follows since the spectral norm is invariant under unitaries. But 
this implies 



\\V P V ] -p\\ tr = \\V P V ] -Vp + Vp- p\\ tr (7.66) 

< ||p(yt_j)|| tr + || ( y_ /)p || tr (7 . 67) 

< IIPlltrll^-ZlL + lly-JlLllpIL (7.68) 

< 45, (7.69) 

where the first inequality follows from the triangle inequality, the second from the fact 
that for Schatten p-norms, || ABC || < || A \\ B || || C (see Section 1.3), and the 
third inequality from Equation 7.65. This concludes the proof. 

□ 

Lemma 7.13. For any constant 5 > 0, there exists a set S := {Ut}'^ =1 of unitaries U t G 
U{C d ) with 

To < (— j , (7.70) 

(for c > a constant) and such that for all U G U(C d ), there exists a U t G S satisfying 

ll^-^tlL<*. 

Proof. For any unitary U G U(C d ), the idea is to replace the columns of U with vectors 
taken from a net on the set of pure states in C d . Of course, the resulting operator U' is in 
general not unitary — however, this can be corrected by an appropriate orthogonalization 
procedure inspired by the "pretty good measurement" [135], yielding a unitary U" such 
that || U — U" < 5, as desired. 

More specifically, by Lemma III. 6 of [ ], there exists an e-net iV on the set of pure 
state vectors in C d (with respect to the Euclidean norm) such that \N\ < (5/e) 2d . Set 
e = Now, for each column \bi) of a given unitary U, we first find an e-close vector 

\b'j) G N, and embed the latter as columns into a matrix U'. Of course, the vectors {|&'j)} 
are not orthogonal in general, so U' is not unitary. To correct this, define the operator 

b = Eti \Wt\ and let 

\K) = B- 1/2 %)- (7-71) 
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Note that if the |£^) are linearly independent, then B is invertible, and moreover {\b'J)} i=1 
is an orthonormal basis since 

d d 

\b")(b"\ =J2 B ~ 1/2 \ b 'i)( b i\ B ~ 1/2 = B-^BB- 1 ' 2 = I. (7.72) 

i=l i=l 

Note that by Lemma 7.14 below, the \b'j) are indeed linearly independent (for large enough 
d) for our choice of e G 0(c/~ 3 / 2 ). 

Now, in order to show that this construction constitutes a 5- net, we must show that 
II U ~ U" IIqq < 5. To do so, we first bound || \b) — \b") || 2 as 



\b) ~ W) || 2 < || \b) - \b>) || 2 + || \b') - \b") || 2 < e+ (/ - B-*)\V) 



< e 



I-B'2 



(7.73) 

where the second inequality follows from our e-net, and the third inequality from the 
definition of the spectral norm. To bound this latter quantity, note that since 



II \Wi\ - MM Hoc < II mm - Mbil || to < 2 || %) - \h) || 2 < 2e, 
where the second inequality follows from Equation (1.33), we have 



B-I 



EciWii-iWi 



8=1 



<Elll^|-|^llloo< 2 ^, 



(7.74) 



(7.75) 



i=i 



and consequently || B x l 2 — / || < 1 2 _ 2 £ & . The latter can be seen by applying the definition 
of the spectral norm in terms of the singular values of its argument and showing that if 

: - 1 



\x — 1| < y for i^O, then 



\b)-\b") || 2 <e + T 



< y/(l —y). We conclude that 
2de 



T + 2d)e . , 



2de ~ 1 

where the last inequality holds when 2de < 1/2. 

With this bound in hand, we can now upper bound || U — U" || as 



(7.76) 



U-U" 



- (U"Y 



max 

\x)eC d S.t. ||a:|| 2 =l 



(U j - (U") r )\x) 

< Vdm^m-ib'^l 

l<i<d 

< Vd(4d + 2)e 

< 5, 



(7.77) 
(7.78) 

(7.79) 

(7.80) 
(7.81) 
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where the first inequality follows since || x || 2 < \fd || x for |x) G C d , the second inequality 
follows from the Cauchy-Schwarz inequality and Equation (7.76), and the third inequality 
from our definition of e, as desired. 

It remains to bound the cardinality of our 5-net: The number of different U" in this 
net is at most T < (5/e) M2 = (30d 3 ^ 2 /S) 2d \ since for each of the d columns of U", we have 
at most (5/e) 2d vectors in our pure state net N to choose from. 

□ 

Lemma 7.14. Let S = {\bi)}f =1 C C d be an orthonormal basis for C d , and let S' = 
{\b'i)} d i=l C C d satisfy || |6 4 ) - %) || 2 < e for all i. Then if e G o(l/Vd), S' is a linearly 
independent set for large enough d. 

Proof. We proceed by contradiction. Assume S' is a linearly dependent set, i.e. there exist 
coefficients a,, at least two of which are non-zero, such that 

i i 

for G C d with || |e*) || 2 < e. It follows that || J2i a i\bi) ||o = II Sj a il e «) II2' implying 
E |«i| 2 < E \oti\ \aj\ |(e»|ej)| < e 2 ^ |oti| [cKj J = e 2 I ^ |«i| 2 + ^ \a>i\ \a>j\ J , (7.83) 

i ij ij \ j v£j J 

where the second inequality follows from the Cauchy-Schwarz inequality. Thus, 

(1 " e 2 ) E N| 2 < e 2 kl < e ' (E h) ^ e ^ (E l^l 2 ) ' ( 7 - 84 ) 

where the third inequality follows since || \v) < \/d\\ \v) || 2 for any |u) G C d . Since S' is 
linearly dependent, ^ |«j| 2 7^ 0, and so dividing both end sides of the chain above by this 
quantity yields 

l<e 2 (d+l), (7.85) 
which for e G o{l/\fd) yields a contradiction for large enough d. □ 
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Chapter 8 



Characterizing quantumness via 
entanglement creation 

This chapter is based on [110]: 

S. Gharibian, M. Piani, G. Adesso, J. Calsamiglia and P. Horodecki. Characterizing quan- 
tumness via entanglement creation. International Journal of Quantum Information, 9(7 & 
8):1701-1713, 2011, DOI: 10.1142/S0219749911008258, © 2011 World Scientific Publishing 
Company, www.worldscientific.com/worldscinet/ijqi. 

In Chapter 8, we introduced an activation protocol which maps general non-classical (mul- 
tipartite) correlations between given quantum systems into bipartite entanglement between 
the systems and an ancilla. Here, we study how this activation protocol can be used to en- 
tangle the starting systems themselves via entanglement swapping through a measurement 
on the ancilla. Furthermore, we bound the relative entropy of quantumness (a naturally 
arising measure of non-classicality in the scheme of Chapter 8) for a special class of sepa- 
rable states, the so-called classical-quantum states. In particular, we fully characterize the 
classical-quantum two-qubit states that are maximally non-classical. 

8.1 Introduction and results 

In this chapter, we continue our study of non-classical correlations (see Section 1.6.2 for a 
brief survey). Specifically, recall that the non-classicality of correlations present in multi- 
partite quantum states is not due solely to the presence of entanglement. Namely, there 



194 



exist quantum states which are unentangled, but nevertheless exhibit traits that have no 
counterpart in the classical world. Such traits include no-local broadcasting [209] and the 
locking of correlations [87, 78] (see Section 1.6.2). Much effort has been devoted in recent 
years to characterize and quantify the non-classicality — or quantumness — of correla- 
tions [203, 138, 187, 119, 196, 118, 217, 185, 54, 209, 188, 94, 13, 216, 231] believed to be 
behind such feats. 

In this context, we proposed an activation protocol in Chapter 7 which maps general 
non-classical (multipartite) correlations between input systems into bipartite entanglement 
between the systems and an ancilla. This was accomplished by letting the ancilla and input 
systems interact via CNOT gates with the systems acting as controls (see Section 7.3 for 
a formal description of the protocol). One advantange of this mapping is that it allows us 
to apply the tools and concepts of entanglement theory to the study of the quantumness 
of correlations. As an added bonus, the activation protocol, when considered in an adver- 
sarial context where the control bases are chosen so as to create the minimal amount of 
system-ancilla entanglement, provides an operational interpretation of the relative entropy 
of quantumness [54, 187, 118, 217, 196] as being the minimum distillable entanglement [210] 
necessarily (i.e. in the worst case scenario) created between the input systems and the an- 
cilla. 

In this chapter, we continue our study of the activation protocol of Chapter 7, and 
present two main contributions towards a better understanding of the quantumness of 
correlations. 

Our Results: Here, we show the following. 

1. Upper bounds on the non-classicality of separable states. We first give a non- 
trivial upper bound on the relative entropy of quantumness for a special class of separable 
states, the so-called classical-quantum states (see Section 1.6.2) (Lemma 8.1). Using this, 
we then fully characterize the classical-quantum two-qubit states which are maximally 
non-classical with respect to the relative entropy of quantumness (Lemma 8.2). 

2. Entangling the input systems via entanglement swapping. The activation 
protocol of Chapter 7 demonstrates how to map non-classical correlations in an initial 
quantum system into entanglement between the system and an ancilla. However, one might 
prefer not to generate entanglement with an ancilla, but rather within the subsystems of 
the initial system itself. 

We thus next study an approach for extending the activation protocol in order to 
entangle the input systems in such a manner as follows: We first run the original activation 
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protocol (i.e. we let each system interact with an ancilla). Next, we try to "swap" [152] 
the entanglement created between the input systems and ancilla back into entanglement 
among the input systems by performing a measurement on the ancilla alone. Note that 
we assume a worst-case scenario in performing this mapping: We ask, does there exist 
a choice of control bases for the activation protocol for which no entanglement can be 
created between the input systems with this approach, even if we allow post-selection after 
measuring the ancilla? 

For this mapping, we derive conditions (Theorem 8.3, Corollary 8.4, discussion in Sec- 
tions 8.4.2 and 8.4.3) under which entanglement can or cannot be swapped back into the 
input system. In particular, we find that there exist non-classical states which, despite 
necessarily leading to the creation of entanglement between systems and the ancilla in the 
activation protocol, may nevertheless fail to allow entanglement swapping back onto the 
initial system for a crafty choice of control bases. 

Discussion and open questions. In this chapter, we first find bounds on the non- 
classicality (as measured by the relative entropy of quantumness) of classical-quantum 
states, and we characterize the maximally non-classical two-qubit classical-quantum states. 
It would be interesting to find bounds on the non-classicality of general separable states: 
from Chapter 7 we know that, for example, a separable state of two qubits can never be 
as non-classical as a maximally entangled pure state, but at present we do not know how 
large the gap between the two is. We remark that the maximally non-classical two-qubit 
CQ states found here (with respect to the relative entropy of quantumness) in Lemmas 8.1 
and 8.2 match those found in Chapter 6 for the measure defined therein based on local 
unitary operations. 

With respect to the swapping of the post-activation ancilla-system entanglement onto 
the original systems, we have both necessary conditions and sufficient conditions for the 
swapping to be possible in an adversarial scenario, but we lack conditions which are si- 
multaneously necessary and sufficient. In finding such conditions, we suspect it would be 
beneficial to study the problem which arises in our swapping scheme: when is it possible 
to make a state entangled by rescaling rows and columns as in Equation (8.17)? 

Finally, most of our results (e.g. Lemma 8.1, Theorem 8.3, and Corollary 8.4) apply to 
higher dimensional systems. However, it would be nice to extend Lemma 8.2 to this more 
general setting by characterizing the maximally non-classical classical-quantum states of 
higher dimension than qubits. Unfortunately, our approach here does not seem to apply 
in a straightforward manner to this setting, and further investigation is needed. 
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Organization of chapter. We begin in Section 8.2 with definitions and background 
information. In Section 8.3, we provide bounds on non-classicality for classical-quantum 
states, as measured by the relative entropy of quantumness. In Section 8.4, we present sev- 
eral results and observations regarding entangling input systems via the activation protocol 
and entanglement swapping. 

8.2 Preliminaries 

Throughout this chapter, we continue to use the notation and definitions from Chapter 7, 
which we briefly outline now. Recall from Definition 7.1 that a strictly classically correlated 
or classical quantum state p is one which diagonalizes in a local product basis B. 

We now outline the activation protocol of Chapter 7 (see Section 7.3 for further details). 
Consider an arbitrary state p G P((C living in register A, where the ith local d- 
dimensional system lives in register Aj. We refer to A as the system. We further introduce 
a joint register A' of <i-dimensional registers A[, . . . ,A' n of n ancilla qudit registers each 
initialized to the state |0)(0| A ,, henceforth called the ancilla (see Figure 7.1). The initial 
state of the joint A : A' system is thus paa 1 = Pa® |0)(0|®r. For a given input p A , we 
first consider for each i an adversarial application of a local unitary U,i to each Aj (i.e. this 
chooses the control basis for system i), and follow by applying one CNOT gate on each 
subsystem Ai (control qudit) and the corresponding ancillary party A\ (target qudit). The 
final state of system plus ancilla at the end of this protocol is 

^ = ^®|0><0|y)Vt, (8.1) 

with V = CNOT A:A ,(U A ® I A ,), U A = <8>2=iUi, and CNOT AA , = ®" =1 CNOT MA[ . Recall 
that by Theorem 7.3, the output p AA , is separable across the A : A' split if and only if 
p A is classical. As done in Chapter 7, we henceforth refer to p A and p AA , as p and pf for 
simplicity, respectively. 

It will be useful to also recall that p* can be written as 

P f = Y,p! 3 \i)(3\ A ®\i}(j\ A , (8-2) 

ij 

where 

P%:=(b{i)\p\b{j)), (8-3) 

for \b(i)) = U A \i). In other words, pf is of the maximally correlated [212] form in the 
A : A' cut. Using this observation, we showed (Theorem 7.4) that if one quantifies the 
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minimum distillable entanglement generated across the A : A' split in this protocol, the 
corresponding measure of non-classicality we obtain is given by 



Q(p) = min (s(p B )-S(p)), 



(8.4) 



for S(p) the von Neumann entropy, where the minimization is over all local product bases B, 
and where p B := J2 i \b(i))(b(i)\p\b(i))(b(i)\. It turned out (Corollary 7.5) that Q(p) in fact 
coincides with the measure of non-classicality known as the relative entropy of quantumness 
(REQ) [54, 187, 118, 217, 196], bestowing the latter with an operational interpretation. 

8.3 Upper bounds for separable states 

In Theorem 7.6, we showed that for a bipartite state pab (where in the bipartite case 
we adopt the notational convention that A\ — A and A 2 = B), the quantity Q(pab) 
can achieve its maximum value of log d only for entangled states. We also showed that for 
increasing local dimension d, Q(pab) for certain separable pab can asymptotically approach 
\ogd. What can be said, however, in the non-asymptotic setting? In other words, for a 
fixed local dimension d, how non-classical can separable pab be? 

In this section, we first obtain a simple upper bound on Q(pab) for the subclass of 
separable states known as classical-quantum (CQ) states, that holds for arbitrary local 
dimensions. Recall from Section 1.6.2 that CQ states are those which can be written as 
Pab = Y^,=\PiV)(A ® Pi f° r {K)},-=l an orthonormal basis, {pi} a probability distribution, 
and dA and ds the local dimensions of systems A and B. We then completely characterize 
the set of maximally non-classical two-qubit CQ states with respect to the relative entropy 
of quantumness Q(pab), and show that such states achieve Q(pab) = 1/2. 

We begin with our claimed upper bound, which holds even when the local dimensions 
of A and B differ. 

Lemma 8.1. For any CQ state pab = J2i=iPi\i) (A ® Pi, one has 




(8.5) 
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Proof. We have 

Q{Pab) = mm S{p B AB ) - S{p AB ) (8.6) 

/ d A / d B \ \ 

= mms(5>ii}<ii® \^\b B m^Bmpi\bB{mbBm\\ - s{ PAB ) 

/ d A \ / d A \ 

= min f ff(p) + J>S (pf fl ) j - \H(p) + 5>£(*)J (8.7) 

d A 

= minY,P*[S{p! B )-S( Pl )}, (8.8) 

db — 7 

4 = 1 

where i?(p) denotes the Shannon entropy of the probability distribution p = {p{\ v and 
the second equality follows from choosing Ba to coincide with the basis {\i)}- Let p m : = 
maxjpj. Our strategy is to let B B project onto an eigenbasis of p m , yielding: 

Q(Pab) < J> i S (Pi") - S M] ( 8 - 9 ) 

< Y.p* s (pf B ) ( 8 - 10 ) 

< (l_ J-Wtfe, (8.11) 



d 



where the second inequality follows since S(pi) > 0, and the third inequality follows since 
Pm > l/^A and S(a B ) < log (is for any density operator a B - □ 

For a two-qubit CQ state p AB , Lemma 8.1 implies Q(pab) < 1/2. We now show that 
this bound is tight by characterizing the set of CQ states attaining Q(pab) = 1/2. 

Lemma 8.2. Consider CQ state p A B G T>(C 2 <8> C 2 ) such that pab = Y^t=iPi\^)(A ® Pi- 
Then Q(pab) — 1/2 if and only if pi = p 2 = 1/2 and p 1 = l^iXV'il an d P2 = IV^XV^I for 
some ^2) e C 2 such that \(ipi\ip 2 )\ 2 = 1/2. 

Proof. That p^s with p\ 7^ 1/2 implies Q(pab) < 1/2 follows immediately from Equa- 
tion (8.10) and the fact that < S(o~) < 1 for any 1-qubit density operator a. We thus 
henceforth assume p\ = P2 = 1/2. That pi and pi must be pure now also follows anal- 
ogously, for if, say, p\ is mixed, then we simply choose B B in Equation (8.9) to instead 
project onto an eigenbasis of P2, and use the fact that S(pi) > to achieve Q(pab) < 1/2. 
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We thus henceforth assume p\ = l^'iXV'il ari( i P2 = IV^XV^I for some IV'i),!^) £ C 2 . It 
remains to show that we must have |(V'i|V'2)| 2 = 1/2. 

Plugging pas into Equation (8.8) and noting that S(pi) = S{p2) = 0, we have 
Q(pab) = l^[S([\^)(H} BB ) + S(m(H} BB )] (8-12) 



- mm 

2 B S 



^(I(^(0)|^)|M(^(i)|^i)| 2 )+^(I(&b(0)|^)|M(6 b (i)|^)| 2 ) 



- mm 

2 IMo)> 



f |(Mo)|^i>lM(Mo)l^)| 



# l(6fl(0)|^>r, (6b(0)|^> 



(8.13) 



where (ipi\?Pi) = (V^IV^) — 0, and where the last equality follows since |6_B(j))(6s(j)| are 
rank-one projectors. Note that one can think of the last equality as effectively switching 
the roles of the measurement and the target state, so that the minimization can be thought 
of as being taken over all pure target states |&b(0)) with respect to measurements in the 
bases B 1 := {|Vi), |^>} and B * : = {IV>2>, |^»- We can now plug Equation (8.13) into 
the well-known entropic uncertainty relation of Maassen and Uffink [189, 251], which states 
that for classical distributions P c and Pd obtained by measuring pure state \ip) with respect 
to orthonormal bases C = {|c)} and T> = {\d)}, respectively, we have 

l(H(P c ) + H(P d )) > -logf(C,V), (8.14) 



where f(C,V) := max{|(c|d)| | |c) G C, \d) E V}. We thus obtain: 



AB , 



> max 

\<pi)eBi,\<t> 2 )eB 2 



log|<0l|02>| - 



(8.15) 



Note that this lower bound attains its maximum value of 1/2 if Si and £>2 are mutually 
unbiased, i.e. when KV'ilV^)! = 1/2. On the other hand, suppose £>i and £> 2 are not 
mutually unbiased, i.e. suppose without loss of generality that KV'il^)! 2 > 1/2. Then 
choosing |&b(0)) = \ip\) in Equation (8.13) yields Q(pab) < 1/2. The claim follows. □ 



Combining Lemmas 8.1 and 8.2, we obtain a characterization of the set of two-qubit CQ 
states which are deemed maximally non-classical by Q. Such states include, for example, 
the CQ state 



1 



-|0)(0|®|0)(0| + -|l)(l|(2)|+)( 



/ 1 










\ 



1 



(8.16) 



/ 
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where |+) = (|0) + \ \))/y/2. 

8.4 Swapping the ancilla-system entanglement onto 
the system 

We now explore the possibility of generating entanglement in the original system AB by 
projecting the ancilla systems A' of the state pf of (8.1) jointly onto an entangled pure 
state. In other words, we consider an entanglement swapping process [152] that maps the 
system-ancilla entanglement onto the systems AB. As we are only interested in knowing 
whether this is possible (rather than, say, in the probability of success), the filtering via 
a pure state is not restrictive and corresponds to the best possible strategy. Our results 
indicate that this feat is possible for some, but not all, separable non-classical states. 

We begin by noting that thanks to the maximally-correlated form of pf (Equation (8.2)), 
we have that the (unnormalized) final state of system AB after projecting the ancilla system 
onto (normalized) state \<p) = Yli^l^) (C d )® n is given by 



with pfj defined in Equation (8.3). Hence, the resulting (unnormalized) state p^ is simply 
the Hadamard product of the original state (represented in the B basis) and \(j>){(j>\ (rep- 
resented in the computational basis), i.e. p^ = p B o for o the Hadamard product 
defined such that (C o D)(i,j) := C(i,j)D(i,j). 

As previously mentioned, our goal is to answer the following question: For a given 
input p, is it true that for any choice of starting local bases for the CNOT gates in the 
activation protocol, there exists a state \<j>) such that p^ is entangled (across its constituent 
local ci-dimensional systems)? 

In Section 8.4.1 we provide a simple sufficient condition under which the generation of 
entanglement in the original system is always possible with an appropriate choice of \<f>), 
regardless of the choice of adversarial local unitary. We then observe that this condition 
holds for all pseudo-isotropic states as in Equation (8.19), with entangled and p > 0. 
In Sections 8.4.2 and 8.4.3, we provide examples of classical-quantum (CQ) and quantum- 
quantum (QQ) separable states, respectively, for which entanglement in AB cannot be 
generated in this fashion, i.e. there exists a choice of Uab that prevents the generation 
of entanglement in AB via the swapping of system-ancilla entanglement, even if there is 




(8.17) 
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necessarily entanglement between the system-ancilla cut after the activation protocol is 
run. 



8.4.1 Sufficient condition for entanglement swapping 

We focus again on the bipartite case A\ = A, A 2 = B. We have the following simple 
condition which ensures the swapping of entanglement is possible. 

Theorem 8.3. If for any choice of local basis B, there exists a non-zero off-diagonal 
element of an off-diagonal block of p B AB , i.e. if for all B = BaBb there exists a choice of 
i 7^ j and k ^ I such that (&,i(z)&b(A;)|pab|&aC7)&b(0) 7^ 0; then it is possible to swap 
entanglement back into the input systems (regardless of the choice ofB), i.e. there exists a 
\4>) such that p^ is entangled. 

Proof. The strategy of the proof is to choose \<p) so that the result of the Hadamard product 
in Equation (8.17) is non-positive under partial transposition (NPT) [ 36, 146]. Fix any 
choice of local basis B. By assumption, we know there exist indices % ^ j and k ^ I such 
that (bA(i)bB(k)\pAB\bA(j)bB(l)) ^ 0. In order to ensure that p^ is NPT, we thus choose 
|0) to single out these non-zero off-diagonal terms by setting 

|0) = _L(|iA;> + b7>)- (8.18) 

With this choice of p^ becomes a Hermitian matrix with only four non-zero entries, 
two of which lie on the diagonal at positions <8> \k)(k\ and <S> an d two of 

which lie at off-diagonal positions of off-diagonal blocks at <8> \k)(l\ and <8> \l)(k\ 
(i.e. the four entries form the four corners of a square). It follows that the partial transpose 
of Ptf, is not positive. □ 

Corollary 8.4. For any 

pU>,p)ab ■= (1 -p)^ +p\1>Mab> ( 8 - 19 ) 

with Iab/D the maximally mixed state for AB and D the dimension of AB, if is 
entangled and p > 0, then there exists a choice of \<p) such that p$ is entangled. 

Proof. Since the maximally mixed component of (8.19) is diagonal with respect to any 
choice of local bases, it suffices to argue that satisfies the condition of Theorem 1. This 
easily follows from the fact is entangled, and thus has, up to local unitaries, a Schmidt 
decomposition J^t^Q 1 V^k\k)\k), with \ > Ai > 0. □ 
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Corollary 8.4 shows that for any value of p > 0, entanglement can be transferred 
to the original system for the pseudo-isotropic state p(p,ijj) of Equation (8.19), even for 
values of p which correspond to separable states (recall that for p small enough, the state 
p(p,i/j) is separable due to the existence of a separable ball around the maximally mixed 
state [153, 124]). We remark that for all p > and entangled \if)), p(p, ifr) is known to be 
non-classical [ ], and so here the non-classicality of the starting state allows us to create 
entanglement in the original systems AB by applying the activation protocol followed by 
our entanglement swapping procedure. 

8.4.2 Classical-quantum separable states 

In Section 8.4.1, we demonstrated that for certain non-classically correlated states, entan- 
glement can be mapped back into the original system after the activation protocol is run. 
Can this be achieved with any type of non-classically correlated input p? We now show 
that the answer is no — there exist separable non-classical p such that, while entanglement 
is always generated in the activation protocol between systems and ancilla independently 
of the local unitaries Ua and Ub, a proper adversarial choice of local unitaries Ua and Ub 
can nevertheless prevent entanglement from being mapped back to the system. 

Consider the separable non-classical CQ state of Equation (8.16). By Equation (8.17), 
note that when the adversarial local unitaries are chosen as Ua = Ub = I, we have 



Since p is block diagonal, it hence follows that p^ is block diagonal, since the Hadamard 
product cannot change this block diagonal structure regardless of the choice of \<fi). We 
conclude that there exists a choice of local bases (i.e the computational basis) with respect 
to which pip is always separable for all i.e. it is not possible to project the (necessarily 
present) system-ancilae entanglement generated in the activation protocol back onto the 
system. In fact, this proof approach holds for any CQ (or QC) state that is not strictly 
classically correlated, implying that for such states, there is a choice of local unitaries 
for which, even if entanglement is created between system and ancilla in the activation 
protocol, such entanglement cannot be swapped back into the input system. 

8.4.3 Quantum- quantum separable states 

Based on the results in Section 8.4.2, one might hope that entanglement generation in 
separable starting systems is possible if p is not CQ nor QC (i.e. p is what we might call 




(8.20) 
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QQ separable). We provide a counterexample to this conjecture here — namely, we show 
that there exist QQ separable states for which an adversarial choice of local bases in the 
activation protocol prevents the swapping of ancilla-system entanglement back into the 
input systems. 

To do so, consider the separable QQ operator: 



PAB = ^ l°X°l 



I+X+I + gl+X+l® l°><°l 
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(8.21) 



To prove our claim, as in Section 8.4.2, we choose local adversarial unitaries U \ = Ub = I 
and show that pf = p AB o \4>){<j>\ is separable for any choice of The latter is shown 
by first deriving a condition under which the eigenvalues of Hermitian operators with a 
structure similar to p remain invariant under partial transposition. We then show that 
p fulfills this condition for any choice of \<p), implying p always remains separable, since 
the partial transpose is a necessary and sufficient condition for separability of two-qubit 
states [146]. 

Lemma 8.5. Given any Hermitian operator X acting on C 2 ®C 2 with off-diagonal blocks 
which are diagonal, i.e. 
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x= i 2 r ^ , (8.22) 



if either a^aj^ 
transposition. 



or ai3a24 ^ ^> then the spectrum of A is invariant under partial 



Proof. Let pxW an d Px r W denote the characteristic polynomials of X and X r , the 
partial transpose of X, respectively. Then 

Px(A) -Px r W = 2Re(ai2a3 4 ai 3 a24 - a^^ais^) = 4 Im(a^ 3 a 2 4) lm(a 12 a* u ), (8.23) 

where Re(x) (Im(a;)) denotes the real (imaginary) part of x. The claim follows for 012034 e 
K. An analogous calculation yields the a^a^ G M. case. □ 



With Lemma 8.5 in hand, it is easy to see that p> has a positive partial transpose (and 
is hence separable) for all \<p) — specifically, we observe that p satisfies the conditions of 
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Lemma 8.5 since 012034 = (1/4) (0) = 0, and this in particular holds even after taking the 
Hadamard product with any \(f))((f)\. Since p is positive semidefmite, it thus follows from 
Lemma 8.5 that pf must also be positive semidefmite under partial transposition and hence 
separable. Thus, there exist QQ separable states for which system-ancilla entanglement 
cannot be mapped back to the system. 

Theorem 8.3 tells us that if a two-qubit state p has off-diagonal terms on its off-diagonal 
blocks for any choice of local bases, then entanglement can be created among the systems 
via swapping. On the other hand, if p is restricted to having off-diagonal blocks which are 
diagonal, as was seen with the CQ and QQ counterexamples considered in Sections 8.4.2 
and 8.4.3, then there are choices of local initial rotations such that entanglement generation 
among the systems is not necessarily possible (actually, in the CQ case, entanglement 
generation is not possible for any choice of local initial rotations). 

One could ask whether this "diagonal off-diagonal" block structure is sufficient to rule 
out the possibility of entanglement generation. The answer is negative. Consider the 
following (un-normalized) positive semidefmite operator which has diagonal off-diagonal 
blocks: 
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(8.24) 



It turns out that the partial transposition of p has a negative eigenvalue (observe that 
p thus also necessarily violates the conditions of Lemma 8.5). Hence, despite the fact 
that p has off-diagonal blocks which are diagonal, it is nevertheless entangled, implying 
entanglement transfer to the system is possible for any choice of local bases: indeed, the 
Hadamard product can be chosen to be trivial, so that the projection simply gives back (a 
locally rotated and unnormalized) pa- 
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Chapter 9 
Conclusion 



In this thesis, we have studied three areas in quantum computation and information: the 
approximability of quantum problems, quantum proof systems, and non-classical correla- 
tions. Our results in each of these areas are summarized as follows. 

With respect to approximation, we have completed some of the first works in an area 
aiming to understand the computational complexity of efficiently and rigorously comput- 
ing approximate solutions to problems which are complete for quantum complexity classes. 
In Chapter 2, we demonstrated a polynomial time approximation algorithm for dense in- 
stances of the canonical QMA-complete problem, the local Hamiltonian problem. This 
required the derivation of a lower bound on the approximation ratio achievable by prod- 
uct state assignments, which as discussed in Chapter 2, can be seen as negative progress 
towards a sought-after quantum PCP theorem. Among other open questions discussed 
therein, perhaps the most natural direction here is the pursuit of further new approxi- 
mation algorithms for problems complete for QMA, the quantum generalization of NP. 
In Chapter 3, we then proceeded in the opposite direction by demonstrating hardness of 
approximation results for a new quantum complexity class we defined, cq-£2- This class is 
an arguably natural generalization of Eg, and the hard-to-approximate problems for cq-£2 
we considered are generalizations of classical covering problems obtained via the notion 
of quantum constraint satisfaction (i.e. local Hamiltonian constraints). Aside from the 
obvious open questions here regarding further hardness of approximation results such as 
a quantum PCP theorem, we would be interested to know to what extent the class cq-£2 
itself may play an important role in quantum complexity theory, just as has proven 
valuable in the classical setting. 

With respect to quantum proof systems, in Chapter 4 we focused on the question of 
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whether multiple unentangled provers can be simulated by a single prover in the context 
of QMA proof systems. As the question of whether two provers are as good as one (i.e. is 
QMA = QMA(2)?) remains open despite much effort, we focused our attention on variants 
of QMA in which the verification protocol is suitably restricted. In this setting, we showed 
various results, including a collapse to QMA for a restricted variant of QMA(poly), and an 
alternate proof of a parallel repetition theorem for SepQMA(2). Understanding the non- 
trivial "power of unentanglement" [1 l] between quantum provers remains a challenging 
and interesting direction of work. 

Finally, with respect to non-classical correlations, in Chapter 5, we first motivated the 
study of such correlations by examining their role in the DQC1 trace estimation algorithm, 
as well as the quantum communication task of locking. Above all, understanding the precise 
role such correlations play in mixed-state quantum computing remains an important open 
question. In Chapter 6, we then proposed a novel scheme for quantifying non-classical 
correlations based on a special class of local unitary operations. This raised the question 
as to how the notions of "disturbance under measurement" and "disturbance under unitary 
operations" differ in their characterization of non-classical correlations. Finally, Chapters 7 
and 8 introduced and studied a protocol which "activates" non-classical correlations present 
in a multipartite quantum system into entanglement between the system and an ancilla. 
Aside from yielding a new framework through which new non-classicality measures can be 
discovered, our study here also revealed a surprising result: That mixedness in quantum 
states can play a very important role in giving rise to non-classical correlations, both for 
separable and entangled states. We would be interested to see how this framework may 
be further developed, and moreover whether the ideas behind it may prove useful in a 
quantum computational setting. 

In conclusion, the field of quantum computation and information is, after over two 
decades of study, arguably no longer in its infancy. With a solid theoretical base and 
formalism in place, including the quantum circuit model, quantum complexity classes and 
proof systems, and foundations for quantum information theory, the field now covers a 
large number of areas of study, of which our focus here is but a small part. Yet, whether 
quantum computers will, at a practical level, indeed be the wave of the future, is in our 
opinion not yet entirely clear. What is clear, however, is that no matter the outcome, the 
lessons learned through this line of work have already taught us much about the physical 
world around us. Indeed, the study of this field has united the physics and computer 
science communities towards a common ultimate goal: To probe the physical limits of 
nature and computing themselves. This in itself is no small feat. As it stands, information 
is physical. We would not (and could not) have it any other way. 
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